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ABSTRACT 


A discrete state, discrete time Markov-type manpower flow 
model having a two-dimensional state space (i.e., a two 
characteristic model) is analyzed. The probabilistic proper- 
ties of the model and the equaticns of stocks and flows are 
developed. A new method of representing the stocks as a sum 
of steady-state and transient components is presented. Two 
specific applications of the model to multi-grade hierarchical 
systems in which the dimensions of the state space are (grade, 
length of service) and (grade, time in grade) are analyzed in 
detail. The problem of combining states across the second 
dimension of the state space is studieä and methods are 
derived for the (usual) case where the states are not lumpable. 
Finally, some applications of the model with state space 
(grade, skill group) to retraining problems between various 


skill categories are presented. 
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I. INTRODUCTION 


This paper deals with manpower flow models, specifically 
discrete state, discrete time Markov-type models in which 
the states are described by two characteristics. This type 
of manpower flow model will be referred to as a "two- 
Characteristic model" to distinguish it from the more common 
model in which the states are described by one characteristic. 

Markov-type flow models have been used for some years; 
see for example Young and Almond (1961), Blumen, Kogan and 
McCarthy (1955), Marshall and Oliver (1970) and Rowland and 
Sovereign (1969). Theoretical properties of the model are 
developed in Bartholomew (1967). Charnes, Cooper and Niehaus 
(1972) use the Markov-type flow model as part of a larger 
manpower ers model. 

The latter application is typical in that the Markov-type 
flow model is not used in isolation to project stocks and 
flows of people, but rather it is used to provide information 
to a more extensive model that deals with budgets, capital 
investments and aggregate work planning. The simple mathe- 
matical structure and the computational tractability of 
Markov-type models make it practical to embed them in larger 
models. 

Early Markov-type models had one-characteristic states, 
such as pay grade or status in a hierarchical system. 
Geometric lifetimes in these states were thus implied by the 


model structure and often could not be justified. However, 





the one-characteristic model usually led to a small state 
Space which implied little computational difficulty. The 
Simple mathematical structure of these models was also 
appealing, and analytical results were obtainable. 

In an attempt to overcome the shortcomings of these 
Simple models subsequent models in manpower planning used 
more than one characteristic to describe the states of the 
system. See for example the U.S. Navy ADSTAP models (U.S. 
Navy, 1973) and the U.S. Air Force TOPLINE model (RAND, 1973). 
In the documentation we have seen these have not been formally 
recognized as Markov-type models. Each model has been treated 
aS an individual case. By viewing these models as special 
cases of a more general two-characteristic Markov-type model, 
we present in this thesis a unified treatment of such a medel. 
We are able to develop the theory of its structure and we use 
this to analyze special cases frequently applied in military 
manpower planning. Some new applications of the model to the 
problem of retraining between skill groups in an organization 


are presented. 


A. GENERAL DESCRIPTION 

A graded manpower system is one in which each person is 
assigned a "grade," e.g., a pay grade, a rank or a status 
label. Such a system may be analyzed using a Markov-type 
model in which the states are defined to be the grades. 
Thus a state is identified by a single characteristic and we 
say the model has a one-dimensional state space. The analysis 


of a graded manpower system by such models is often suspect 





because the grades do not provide a state space within which 
the system is actually Markovian. For example those ina 
given grade who progress rapidly through the lower grades may 
have significantly different promotion prospects from those 
in the same grade whose progress was less rapid. Thus the 
future progress of an individual ina state depends on how 
he reached the given state. 
As is frequently the case in Markov models, expanding 

the state space may lead to a model of the system in which 
the Markov assumption is more realistic. In this paper the 
state space of a Markov-type model of a graded manpower sys- 
tem is expanded to one in which each state is identified by 
a couple (i,j) where i represents a grade and j represents 
es econd characteristic. Examples of a seceme characteristic 
are: 

1) Length of service in the system (LOS) 

2) Time in grade (TIG) 

3) Skill category 

4) Overall performance mark 

5) Physical location. 
In cases 1) and 2) the second characteristic is chosen 
primarily to make the Markov assumption more tenable. In the 
other cases the appropriate second characteristic is chosen 
to fit the problem at hand. 

Expanding the state space of a Markov-type model presents 

some difficulties, among which are: 

1) There are more parameters (transition probabilities) 


in the model to be estimated or controlled. 
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This paper does not deal with the problems of parameter 
estimation. In practice the estimation of transition proba- 
bilities in manpower systems is hindered more by lack of 
data than by questions regarding appropriate statistical 
techniques. More pertinent is the number of parameters to 
be controlled, because the models analyzed here are intended 
for planning rather than forecasting. That is, we are inter- 
ested in comparing the results of ee policies rather 
than projecting the consequences of some "state of nature." 
Increasing the number of parameters to be controlled in a 
planning model can lead to practical difficulties. Such 
difficulties are not investigated in a comprehensive fashion 
in this paper; however, in Section D of Chapter IV and 
Sections C and D of Chapter VII we demonstrate heds which 
effectively reduce the number of parameters the planner must 
control. 

2) Computations may be less tractable, if not completely 

impractical. 

The manpower flow model, is typically a component of a larger 
planning model, so it is to be expected that a single run of 
the planning model will require numerous calls for information 
from the flow model. Consider a system in which two grades i 
and k are important plateaus to be reached which affect an 
individual's retirement benefits. A retirement policy planning 
model might require information on the fraction of people who 
will attain grade k given that grade i is attained as well as 
the distribution of time required to reach grade k. Practical 


methods for deriving this type of information from a 


17 





two-characteristic model are developed in Chapters II and III. 
Although a two-characteristic model in a military manpower 
application would have at least one hundred states, it is 
shown that much information of interest can be derived by 
calculations restricted to a smaller number of states. 

3) Analytical results which yield insight into the 
properties of the system may be more difficult to 
obtain. | 

Certainly analytical results are more difficult to obtain in 
a two-characteristic model than da a one-characteristic model. 
Nevertheless, we obtain a number of results which include the 
structure of the fundamental matrix (Chapter II), the equi- 
librium distribution of people in the system under various 
hiring policies (Chapter III), the effect of average time 
spent in a grade on the equilibrium distribution of people in 
that grade (Chapter VI) and the relation between promotion 


rates and the number in a grade (Chapter VII). 


B. CONTENTS AND SUMMARY 

Chapter II begins with the definition of basic notation 
and the statement of assumptions. Probabilistic properties 
of the transient matrix of the two-characteristic model, such 
as the expected number of visits to a state and the expected 
time spent in a građe are developed. A not uncommon property 
Of manpower flow mođels, the 0-1 visiting property" (i.e., 
each state can be visited no more than once), is defined and 
Pes simplications discussed. The "no return property" (i.e., 


a transition into a state is impossible once a transition has 
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been made out of it) is defined. If the states in a grade 
have the no return property then it may be assumed without 
loss of generality that the fundamental matrix for that 
grade is upper triangular and thus is relatively easy to 
compute. Matrices of the probabilities of going from one 
state to another in t steps (t-step transition matrices) are 
defined and a recursive formula for their calculation is 
shown. The t-step transition matrices are subsequently used 
in Chapter III to represent the stocks as the sum of steady- 
state and transient components. In the last section of 
Chapter II is it shown how the basic probabilistic properties 
of the transient matrix of the two-characteristic model are 
developed when one conditions on the attainment of a higher 
grade. The resuits of Chapter II follow closely the results 
of Kemeny and Snell (1960), but extend them to the two- 
characteristic model. Thus Chapter II contains a compre- 
hensive treatment of how probabilistic information may be 
obtained from a two-characteristic model, and such a treat- 
ment has not been previously published. 

Chapter III begins with a definition of the timing 
convention. In real systems people enter, leave and change 
grades or skills continuously. In order to fit a Markov- 
type model in discrete time the notion of a "period" must 
be carefully defined and a consistent counting convention 
used. Since planners almost invariably use discrete planning 
periods, these conventions lead to useable results. We 
assume time is divided into discrete periods (for example, 


years). Each period is denoted by the integer value of time 
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at the end of the period. People are counted only at the 
last instant of a period. Various stock and flow vectors 
are defined; the term stock refers to the number of people 
in a state or set of states at the end of a period, the 
term flow refers to the number of people who make transitions 
from one state in one period to another state in the next 
period. In Section B we develop equations for computing the 
expected stocks at time t from the values of stocks and flows 
prior to time t. Section C contains the development of a new 
method for expressing the stock vector as a sum of a "steady- 
state" component and a "transient" component. The steady- 
state component of the stock vector is derived for the cases 
where external flows are 

1) Constant 

2) Growing linearly, 

3) Growing geometrically. 

Chapter IV deals with the two-characteristic model where 
length of service (LOS) is the second characteristic. This 
model is called the (grade, LOS) model. The chapter begins 
with background material showing how practical considerations 
might lead the manpower planner to use the (grade, LOS) model. 
In Section B we present definitions and display some of the 
matrices associated with the (grade, LOS) model in order to 
show their rather simple structure. The concept of a cohort 
(a group of people who enter the system at the same time and 
in the same state) is discussed. It is noted that when entry 
to the system is restricted to a single state, the (grade, 


LOS) model preserves cohorts in the sense that members of 
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different cohorts never simultaneously occupy the same state. 
Section C discusses computationally efficient methods for 
performing the matrix multiplications needed to derive 
information from the (grade, LOS) model. The discussion in 
this section depends on Appendix B which treats the multi- 
plication of "diagonal matrices." Section D shows how the 
structure of the (grade, LOS) model might be exploited in 
solving a problem regarding the minimization of the costs of 
reenlistment (in a military organization). In the last 
section of Chapter III, Section E, we discuss how the 
modeler may treat grade as the second characteristic and LOS 
as the first characteristic. Such an interchanged model is 
called the (LOS, grade) model. It is shown that the (LOS, 
grade) model ea flexible than the (grade, LOS) model. 
Chapter V deals with the two-characteristic model where 
time in grade (TIG) is the second characteristic. This model 
is called the (grade, TIG) model. The chapter begins with a 
brief discussion of conditions under which the model might be 
appropriate. In Section B the definitions are stated and the 
y 
structure of various matrices associated with the (grade, 
TIG) model is displayed and discussed. It is noted that the 
(grade, TIG) model is equivalent to a discrete semi-Markov 
process. In Section C we discuss how the modeler may treat 
grade as the second characteristic and ‚TIG as the first 
characteristic. The resulting model, the (TIG, grade) model, 
is different in structure from the two-characteristic models 
previously studied. However it is briefly shown how one 


might derive information from Such a model. 
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Chapter VI treats the problem of combining states in a 
two-characteristic model. The problem is of practical 
importance because often there is more than one reasonable 
choice of the second characteristic to consider, and these 
would lead to higher dimensional state spaces. For example, 
length of service, time in grade and skill group are three 
reasonable choices of the second characteristic in military 
Manpower planning models. Only one of these can be chosen, 
so it is useful to have some guidance regarding how one might 
combine states across the characteristics that are not chosen 
for inclusion in the two-characteristic model. In Section A 
we investigate the condition under which the states in each 
grade of a two-characteristic model are "lumpable" as defined 
by Kemeny and Snell (1960). In Section B we discuss how one 
might combine states when the conditions for lumpability are 
not satisfied. 

In Chapter VII the problem of retraining people between 
various skill groups is considered. Accordingly the second 
characteristic of our model is taken to be the individual's 
skill group. The retraining problem is formulated under a 
condition of equilibrium stocks and flows; attrition rates 
must be specified. The decision variables for the planner 
are stocks, promotion rates and total retraining costs. Of 
course these variables are interdependent. We treat stocks 
as the independent variables and show the relation of promotion 
rates to stocks when external flows are specified. Two methods 


for varying the stocks to achieve desired promotion rates are 


developed. Equations to compute the availabilities and 
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requirements for retrained people are shown. The total cost 
of retraining is modeled using the classical transportation 
problem. The foregoing availabilities and requirements are 
treated as the supplies and demands in the problem; the unit 
costs of retraining people are treated as the transportation 
costs. The relation of the tctal cost of retraining to the 
stocks (and promotion rates) is developed. We assume that 
the planner wants to minimize the total cost of retraining, 
however we do not assume that he can explicitly describe the 
constraints on stocks and promotion rates that must be 
observed in minimizing costs. We develop a technique in 
which the planner can vary the stocks in two successively 
indexed grades so as to reduce total retraining costs, while 
holding the stocks and promotion rates in all other grades 
unchanged. This technique 1S quite practical because the 
planner needs only to be able to recognize combinations of 
stocks and promotion rates that are acceptable rather than 


having to formally specify a set of constraints. The practi- 


cality of this technique is further enhanced by our demonstra- 


tion that when hiring is restricted to the lowest pay grade 
the total retraining cost is a convex function of the numbers 
of people hired and promoted each period. Thus if the 
collection of acceptable combinations of stocks and promotion 
rates is a convex set any locally optimal solution to mini- 


mizing total retraining costs is globally optimal. 
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II. PROBABILISTIC PROPERTIES OF THE TWO-CHARACTERISTIC MODEL 


In this chapter we develop various probabilistic proper- 
ties of the two-characteristic model. One of our purposes 
here is to present a unified, computationally tractable 
approach to deriving probabilistic information from a two- 
characteristic model. 

Let Q be the transition matrix for the transient states 
of a two-characteristic model. (The reader should see 
Chapter 3 of Kemeny and Snell (1960) for a general treatment 
of transient states in finite Markov chains.) For a typical 
military manpower planning application Q has dimensions at 
least 100 x 100. If the model were applied to the U.S. Navy 
enlisted force with nine pay grades as the first characteristic 
and 88 skill groups as the second characteristic, then Q would 
have dimensions 792 x 792. Military applications with even 
larger state spaces are not uncommon. 

Following Kemeny and Snell (1960) we define the "funda- 
mental matrix," denoted N, to be a. Many of the proba- 
bilistic properties of interest can be obtained from a 
knowledge of N. With large state spaces the computation of N 
is non-trivial. By imposing reasonable restrictions on the 
transitions between grades (no demotions and no multi-grade 
promotions) we are able to define a fundamental matrix N, for 
each građe i, determine many probabilistic properties of the 
overall system from these comparatively small matrices, and 


show how these matrices are combined to form the fundamental 
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matrix N. In military manpower planning applications, the 
number of states in any grade is typically one fifth to one 
tenth of the total number of states in the system, so 
computing the fundamental matrix for a grade is signifi- 
cantly easier than computing the fundamental matrix for the 
entire system. Usually only limited portions of the fundamental 
matrix for the whole system are needed, and these may be 
readily calculated using the techniques developed here based 
on the fundamental matrices of the individual grades. 

Certain matrix and vector notation used in this and 


subsequent chapters is summarized in Appendix A. 


A. DEFINITIONS AND ASSUMPTIONS 

Each state in the system is identified by a couple (i,j); 
a person is in state (i,j) if he is in grade i and has second 
characteristic j. There are n consecutively numbered grades: 
i=1,+»+»,n. Each grade i has a set of values of the second 
characteristic, denoted J(i), so the state (i,j) is defined 
for jeJ(i) but not otherwise. For example if the second 
characteristic is length of service (LOS), and a person in 
grade l can be in this grade only when his LOS is between 1 
aad 3, then J(1) = {1,2,3}. 

We assume that the second characteristic takes on consec- 
ie integer values in each grade, so we define, 

1(i) = smallest value of the second characteristic 


associated with grade 1, 


u (1) largest value of the second characteristic 


associated with grade i. 
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Thus, 
Molea h = lieli). 


Let W: be the number of elements in J(i). Then, 


Wr u(i) - 1(1) +1. 


Denote the set of states associated with grade i by T;,. Then, 
= Mi Ela), 1=1,:- + ,n. 


Note that T; contains wW: elements. 
n 
Let T= U Tis the complete set of all transient states, 
i=l 


and let To denote the single absorbing state "out of the 
system." 
Let, 


1) O; be a Wi XW; matrix of one-step transition proba- 
bilities q; (jm), },meJ (i), where q; (jm) is the 
BI ty of a transition from state (1,3)eT, to 
state (i,m)eT;. 


2) P. bea W, X W,,, Matrix of one-step transition proba- 


1 +] 


bilities Pp; (jm), jeJ (i), meJ(i+l), where p; (jm) is 
the probability of a transition from state (1,j)eT, 
to state (itl,m)eT,,,. 
3) A; be a w, x 1 matrix of one-step transition proba- 
bilities a; (j); jeJ (i), where a; (j) is the probability 


of a transition from state (i,j) to "out of the system," 


foc. to To: 
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The basic assumption in this thesis is 
AO: Movement between states of the system follows the 
stochastic laws of a stationary finite state Markov 
chain. 
The following restrictions on movements between grades are 
assumed: 
Al: From any state in T; it is possible to make a one- 


F] or To: 


step transition only to states in Ts, T 

The practical significance of this assumption is that no 
person is promoted more than one građe in a single period, 
and no one is ever demoted. We have chosen for the sake of 
definiteness to call the first characteristic "grade." Any 
characteristic that satisfies Al can be used as a first 
characteristic. Length of service is an example of a 
characteristic that satisfies Al, and we take advantage of 
this in Section E of Chapter IV. 

We also assume: 

A2: Each matrix Qs, i=l,°°’,n, 18 a transient matrix. The 
The practical significance of this assumption is that no one 
can stayina grade forever. With Al this means that everyone 
entering the system must (with probability 1) eventually 
leave the system. 

Under AO through A2 the one step stochastic transition 


matrix for the entire system, denoted P, is 
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| 
7 1 | T 
Qn Po | A, 
Q ` | A 
Pp = Bun } 3 . 
N . 
N 1 | An-l 
So, a 
E a A 


(Recall from Appendix A that 0 is a vector of zeroes.) 


The transient matrix Q for the transient states T is then, 


N Pa-l 
ïX 
Q 


n 


The plan of this chapter is to develop the probabilistic 
properties of: 
1) any set of states T,, 


2) any union of consecutively indexed sets Tir 


m 
A Tis 
i=k 


3) the union of all transient states, T. 
One of the purposes of this development is to show that the 
stochastic properties of Q, typically a large matrix, are 


readily calculated in terms of the smaller matrices O; and P.. 
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The results of this chapter follow closely those in 
Chapter 3 in Kemeny and Snell (1960). The notation 
(K&S,3._. _) indicates that a result can be derived from 


theorem 3. _._ in Kemeny and Snell. 


B. FIRST-ORDER PROPERTIES 

The term "first-order properties" is used here as a general 
term for various probabilities and first moments, e.g., the 
probability of visiting a state, the expected number of visits 
to a state and the expected time spent in a grade. 

Under A2 the overall transient matrix Q has a fundamental 
matrix N = (I-90) 71, and each element of N is the expected 
number of visits to the column state starting from the row 


state (K&S,3.2.4). Under Al, O has the structure shown in 


(1). By direct methods it can be shown that 


n-1 
Ny NPN, N} PN P N3 c 8 ah on 
n-1 
ao ees eee 
N. = = (2) 
n-1 
N, T J (N.P.)N, 
NS 1=3 
= | 
SL | 
N 
n 
where N,=(1-0,)°°, ji=1l,’*-,n, the fundamental matrix for grade 


i. Note that the large matrix N is completely determined by 


the matrices N, and P;. Thus the only matrix inversions 


1 


required to find (I-Q) ~ are the inversions of A 
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foie? ,n. This is of considerable computational significance 
because as previously mentioned, O is usually a large matrix. 
Each matrix N., has a probabilistic interpretation. 
We pursue this interpretation and show that these matrices 
can be used to determine other probabilistic properties of 
interest. 
In this and subsequent sections we make numerous defini- 


tions, the geh 


definition is denoted by Dk. 

Let us consider first the properties associated with a 
single set of states T: and define: 

Dl. v; (j.m) = expected number of visits to state (i,m) 

given that grade i is entered in state (i,j) 

D2. V; = a W; X W4 matrix having v; (j.m) as the element 

in row JjJ-1l1(i)+1 and column m-1(i)+l. 

From (2) the element of N, in row j-1(i)+1 and column 
m-l(i)+l equals the expected number of visits to state (1,m) 
given that grade i is entered in state (i,)). RES. 3.2.4). 
So, from definitions Dl and D2, we have, 

UNNE (3) 
Note that the rows and columns of N; and ve correspond to 
states in grade i in the same way as the rows and columns 
of Q.. 

Now define: 

D3. no» = expected time in grade 1 given that grade 1 


is entered in state (i,j) 


D4. Ta = [1 (1(1)),***+ T, w(i))], a W; X l vector. 


The expected time spent in građe i equals the sum of the 
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expected number of visits to the varicus states in grade i. 


From (3) and D3, 


t; (j) = component (j-1(i)+1) of N.I, 


and from D4, 


E CNI a w; x 1 vector (4) 


(Recall from Appendix A that 1 is a vector with all compo- 
nents equal to one.) 

We next turn our attention to where the process goes 
when it leaves grade i. From assumption Al (ne demotions, 
no multi-grade promotions) the process upon leaving Ti must 


enter either Tad or To: Next define: 


BS b.(j,m) = probability of entering grade itl in state 
(itl,m) given that građe i is entered in 
state (i,)) 


bo. 3S. = aw. x w. 


i i itl matrix having b. (J,m) as the element 


in row j-l(i)+l and column m-1(1+1)+1 
DI. b; (J) = probability of ever entering Titl given that 


grade i is entered in state (i,3) 


D8. b; = [b; (1(1)),+**,b,(u(i))], aw; x 1 vector 

DI. D9 (3) = probability of never entering Ti+] given 
that grade i is entered in state (i,j) 

DIO: Dio = [b¿y(1(1)),***,b¡9(u(i))], a w; X l vector. 


From these definitions it follows that. 


B; = N,P,, aw,'x w; matrix (K&S,3.5.4), (5) 


b. = B.1, aw, x 1 vector, 
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DoT t= b; 


N.A., aw. X 1 vector. 
i i i 


The matrix B. is particularly useful in our analyses. 
For example, let f; be al x Wi vector of the number of 


, : gs 
people enterirg T.. Then fB; is al x W;¿y vector of the 


number of these people who will eventually enter Tsay: 
(K&S,3.3.6). This vector is used repeatedly in Chapter III. 
Next we consider the first-order properties related to 
grades i and k where i<k. Define: | 
Dll. b((i,3),(k,m)) = probability of entering grade k in 
state (k,m) given that grade i is 
entered in state (i,)j) 
12, Bay = a W; X W matrix having b((i,j3),(k,m)} as 
the element in row j-l(i)+tl and column m-1(k)+1. 
From definitions D5 and Dll and a simple conditioning argument 
we have 
u(i) 
b((i,j),(it2,m)) = | bs (3-4) bs, (rm). 
r=1 (1) 
So, from D12, 


a 


Notice from Dll that Bas is an identity matrix and from D5 


that B. .,,=B.. More generally it can be shown that for i<k, 
tl] = 
rl 
Bix = ae a w; X W, matrix. 
r=i 
Define: 
D13. v((i,3),(k,m)) = expected number of visits to state 
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(k,m) given that grade i is entered 


in state (1,3). 


Dard. Vip = aw; X Wy matrix having v((i,j),(k,m)) as the 
element in row j-l(i)+l 
D15. b,,„(j) = probability of ever entering grade k, given 
that grade 1 is entered in state (i,j) 
DIG. Dix = [D¿, (1(3)),+**,/bjy(u(1))], aw, x 1 vector. 


Considering each row of Be as the part of an initial proba- 


k 
bility vector that applies to The we then have, 


Vik = B; p Nys aw; X Wy, matrix (K&S,3.5.4), (6) 


and, 


Dix = Bil? a we x l] vector. 


Define: 
D17. Ta (J) = expected time in grade k given that grade iis 
entered in state (i,j) 


D18. DT [Tip (1 (d)) cc, T¿y(u(i))], a W; X 1 vector. 


The expected time in a grade is the sum of the expected number 
of visits to states in that grade, so 
Tik = Viplr a Wi x l vector. 

This completes our study of the first-order properties 
related to the varicus grades of the system. The foregoing 
definitions by no means exhaust the first-order properties of 
the two-characteristic model that might conceivably be of 
interest. It is felt, however, that these properties will often 


be of practical interest and that other first-order properties 


may be readily derived from these. 
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c. TWO SPECIAL CASES 

The elements of the fundamental matrix for grade if, N., 
have a somewhat different interpretation when the states in 
grade i have what we call the "0-1 visiting property." We 
say that a state has the 0-1 visiting property if the state 
can be visited nomore than one time. Important examples of 
two-characteristic models in which all transient states have 
the 0-1 visiting property are the models in which the second 
characteristic is either length of service (see Chapter IV) or 
time in grade (see Chapter V). 

If each state in T; has the 0-1 visiting property, then 
the expected number of visits to a state in Ti is equal to the 
probability of visiting the state. The element of N, in row 
3-1(i1)+1 and column m-1(i)+1 may then be interpreted as the 
probability of visiting state (i,m) given that grade i is 
entered in state (i,j). 

Another property of interest is the "no return property." 
we say that a state has the no return property if it 
is impossible to ever make a transition into the state after 
a transition has been made out of the state. The 0-1 visiting 
property implies the no return property, but they are not 
equivalent. For example, in modeling manpower flows in the 
U.S. Civil Service one might use "pay step" as a second 
characteristic. Each state is then a couple (grade, pay step). 
A person can stay in the same pay step for more than one 
period, so if there are no demotions then each state would 


have the no return property but not the 0-1 visiting property. 
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If the states in T; have the no return property then it 
is possible to order the states in T; so that O; is upper 
triangular. When O; is upper triangular so is 1-9; and the 
computation of the inverse of I-Q., i.e., the fundamental | 
matrix for grade i, Ni, is considerably easier than in the 
general case. 

If the states in T, have the 0-1 visiting property, then 


not only is N; upper triangular but also the elements of N; 


on the main diagonal are all ones. 


D. VARIANCES 
The format in this section follows closely that of Section 
B, but here we are concerned with various second moment 


properties of the two-characteristic model. 


Define: 
DES. Vane = variance of the number of visits to 
state (i,m) given that grade i is entered 
in state (i,j) 
D20. De = aw; XW; matrix having Və į (Jm) as the element 


inro J) Orl and colunn m=1(i)tl: 
Following (K&S,3.3.3), 


i)sq’ 


(See Appendix A definitions of "dg" and "sq.") 


a = DAN de ~ I) - (N 


Define: 
D2 L. T, 40) = variance of the time spent in grade i given 
that grade i is entered in state (i,j) 
D22. >; = It, gL) re ty sud, aw, x l vector. 
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Following (K&S,3.3.5), 


T . = . = . -= ‘ 
2,1 en; DE; g 
Define: 
D23. v„((i,j), (k,m)) = varıance of the number of visits to 


state (k,m) given that grade i is 


entered in state (i,j) 


D24. V„(i,k) aw, X Wy matrix having V>((i,3),(k,m)) as 
the element in row j-1(i)+1 and column 
m-1(k)+1. 
Following (K&S,3.3.6), 
V5 (i,k) = Vik RP a, - I) - Ve Sao 
Define: 
D26. T5((1,5),k) = variance of time spent in grade k given 
that grade i is entered in state (i,j) 


Bye) = (t,(,10)),k),---, live 


a W: x ] vector. 


Following (K&S,3.3.6), 


T,(i,k) = B,„(2N, - I) Ty = E 

If each state in T; has the 0-1 visiting property, then the 
diagonal elements of N, are equal to one, and, 

De = 1, 


a Isa! 


DE) = Vip Vik sa" 
E. MATRICES OF t-STEP TRANSITION PROBABILITIES 


In this section we consider the probability of being in 


state (k,m) t steps after being in state (i,j). The 
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matrices of these probabilities are called the t-step transi- 
tion matrices. They are used in Chapter III to represent the 
stock vectors as a sum of steady-state and transient components. 
In Chapter IV the t-step transition matrices are shown to 
provide a description of how "cohorts" flow through the system. 
Define: 
D28. m(t:(1i,3),(k,m)) = probability of being in state (k,m) 
t steps after being in state (i,j), 
tU, + 
D29. M., (t) = a W XW matrix having m(t: (i,j), (k,m)) 
as the element in row j-l(i)+l and 
column m-1(k)+1. 
The rows of M.„(t) are associated with states in T;; the 
columns of Mey (t) are associated with states in Ty. 
We have immediately that 
M.; (0) = I. 
From assumption Al (no demotions, no multi-grade promotions) 


we have, 


oli 


M.,(t) A 


wit Sn. 


| 
ON 


M., (t) 
(Recall from Appendix A that 0 denotes a matrix of zeroes.) 

If the process is to be in state (k,m) exactly t steps 
Mieerebeing in state (i,j), then it must be in some state in 
grades k or k-l exactly t-l steps after being in state (i,j). 


Conditioning on this fact leads to the recursive equation, 


= == — 3 = eee F 
Mo (t) = Mi, (t-1)0; + Mj _j(t-DP;_y, t= 21,2, (7) 


K 
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For any i and k the sum over t of the probability 
matrices Mp (t) gives the matrix of the -expected number of 
visits to states in građe k starting from states in grade i. 


So we have, 


ne 


| 
e 


i<k, (8) 
= 0, otherwise 


By assumption A2 the O: matrices are transient, so Vi 


k is a 
matrix of finite elements. This implies that, 
lim M. (t) = 0. (9) 


t> 


From (7) it can be shown by an inductive argument that 
oo I ea i (10) 


The t-step transition matrices provide a rather compre- 
hensive picture of how people move through a two-characteristic 


System. 


ES CONDITIONING ON PROMOTION 

In manpower planning one is often interested in conditional 
probabilities, e.g., the probability of attaining grade k 
given that grade i is attained. The stochastic properties 
of the transient matrix Q under conditioning on promotion are 
briefly developed in this section. 

Define: 


D30. (1,3:t) = the event "in state (i,j) ac Cine t” 


O31. ni =< {ibe ovont a transition is made into T, before 


leaving the system." 
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Conditioning on the event Te is the same as conditioning on 
promotion to grade k. 


Define: 


D32. gq. (j,m) = Pr[(i,mztt1] (i,j7t)) 


Prl(i,m;t+1) |(i,j;t), T 


4 WS cee * 
D32. q, (3,m) 141] 


Provided that Pe en # 0, we have 





*,. . $ ` * 
qj(3/m) = Pr[(i,m;t+1)|(1,35t), T;,,] 
= Pr[(i,m;t+1)|(i,j;t)] 
Xx ; 2 : 
Pr[T;,„|lä,j;t), (i,m;t+l)] 
x : : 
Pr[T,,,|(i,j;t)] 
Ef ewe ede 
= q,(j,m) x Se 
Sn ea) 
| b, (m) 
= q; (Jm) X Pap (11) 
Define: 
D34. C; = aw; XW, matrix having the elements of b. 


(see D8) on its main diagonal and Zeroes 
elsewhere. 

We will assume that promotion to grade i+l is possible 
from every state in T; Under this assumption En exists. 
If promotion to grade i+l is impossible from some state 
(1,3) then we must avoid conditioning on an impossible event. 
This is readily accomplished by temporarily treating state 


(i,j) as part of T, (out of the system) and redefining 36), 


0 


O. P: and A; accordingly. 


r 
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Define: 
D335. o; =aw xwj matrix having qj (jm) as the element 
= in row j-1(i)+1 and column m-l(i)+l. 


Then from (11) and D34, 


The matrix or 1s the matrix of within grade one-step transition 
probabilities conditioned on the attainment of grade i+l. 
Define: 


Pr [ (i+1,m;t+1) | (1,3;t)] 


D36. I (j,m) 


x : a x 
D37. P; (jm) = Pr[(itl,m;tt+l) | (i,j;t), Ts 44! 
x . E e . 
D38. BP. =a Wi X Wi] matrix having PERC m) as the 


element in row j-l(i)+l and column m-1(i+1)+1. 
We then nave, 
Oe u en en + 
pP, (j,m) = ar (mtt) | (i,j; t), Tia] 
= Pr[(i+tl,m;t+1) |(i,j;t)] 
* >. A S 
E), (+1 ,m; ++1)] 


pl A et) 





: Al 


Thus from D34 and D38, 


The matrix P: is the matrix of one-step promotion probabili- 
ties conditioned on the attainment of grade itl. 
* r _ ler 
Because (0,) = e; O; Ci 
The fundamental matrix for grade i when we condition on 


promotion to grade i+l is 
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ae T * -1 
a * 
=" J (Q,)” 
r=0 
T 1 
= r 
meal Ci Qi Ci 
r=0 
= c,4y.c.. 
1 ees 
Define: 
D39. v:(j,m) = expected number of visits te state (i,m) 
given that grade i is entered in state 
(1,3) and grade i+l is attained. 
D40. vi NS matrix having vi (3,m) as the element 


in row 3-1(1)+1 and column m-1(1)+1 
D41. bi (3,m) = probability of entering grade (i+1) in 
state (i+1,m) given that grade i is 
entered in state (1,3) and grade itl is 
attained. 


D42. B. =aw: XW 


, kyo 
i i en matrıx havıng b; (jm) as the 


element in row 3-1(1)+1 and column m-1 (1+1)+1. 


Then one may show that, 


xo * 
ve = N; r 
and, 
B* = n* p* 
1 i iL 
5 -1 


; xE 
Note that Bs 1s simply B- with its rows normalized; Q; is not 


simply a row normalized form of Q.. 
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, ko o 
As with the matrices Bas products of matrices B; with 
successive indices are well defined; their meaning is that 


of a matrix B as defined in Dll and D12 with conditioning 


ik 
on attainment of grade k. 

The conditioned and unconditioned matrices may be used 
together. For example, the elements of Bee give the 
probabilities of entering grade 1+2 in the column state 


conditioned on starting from the row state in T and even- 


tually attaining grade i+l. 
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III. EQUATIONS OF STOCKS AND FLOWS 


We begin by defining ener ean "stocks" and "flows" and 
then discuss why stocks and flows are important in manpower 
planning models. Next the relations between stocks and 
flows in a two-characteristic model are developed. Finally 
we show how the stocks can be represented as the sum of a 


"steady-state" component and a "transient" component. 


A. DEFINITIONS AND BACKGROUND 
The following three definitions constitute a "timing 
convention"; it is used in this and following chapters. 

(1) A period is the interval of time from immediately after 
an integer value of the time parameter t up to and including 
the next integer value of t. A period is identified by the 
value of the time parameter at the end of the period. Thus, 
A IA 


where ty is an integer. 


period ta 

(2) The number of people in a state at the end of a period 
is referred to as the "stock" in that state. Thus, stocks are 
counted only at integer values of the time parameter, t. 

(3) The number of people who change their status in the 
system from one state to another during any period is referred 
to as a "flow." Flows occur during a period, but we do not 
specify the exact time at which they occur. 

Stocks and flows are of primary importance in most man- 


power planning models. The most obvious reason for this is 
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that costs are closely related to stocks and flows, e.g., 

total payroll depends on stocks, transportation costs or 
retraining costs depend on flows. Recruiting policy and 
promotion policy depend in the short term on present stocks 

and in the long term on how we model future stocks and flows. 
Determining the feasibility of a retirement plan and evaluating 
the effects of a change in billet structure are other instances 
in which the planner needs to be able to nodi stocks and 

flows in a manpower system. 

It should be noted that the Markov-type model is not the 
only method that one might use to model stocks and flows. A 
different method is the "cohort model.” Marshall (1973) 
presents a comparison of the Markov-type and the cohort 
models. Another method for modeling stocks and flows is the 
“chain model" presented in Grinold and Marshall (to be pub- 
lished). Meeting of various manpower flow models in U.S. 
Navy (1973) indicates that the Markov-type model is by far 
the most commonly used method in military applications. 

We now define the variables that are used to model the 
stocks and flows in the two-characteristic model. Recall that 
T; is the set of states associated with grade 1, Ws is the 
number of states in Tis and for convenience of notation we 
assume the second characteristic takes on successive integer 
values in grade i. 

In a Markov model the stocks and flows are in general 
random variables. In this thesis we deal only with the 
expected values of stocks and flows. Such a model ae called 


a "fractional flow model" because the transition probabilities 
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Of the Markov model are in effect treated as fractions which 


direct flows through the system in a deterministic manner. 


Let, 
Sjj (t) = expected stocks in state (i,j) at time t, 
Pi (t) SÓ ee Ss uli) eye, 


alx Wi vector of expected stocks in Ti; 


s(t) = (s} (t), =, s (t)), 
n 
alx }_ wį vector of expected stocks in 
i=l 
the system. 
By assumption Al, flows into any state in Ti must come 


from a state in either T; or We will also make 


Ma 
provision in our model for "external flows." The source 

of such flows is unspecified. However, we may consider 
external flows as consisting of people hired into the system. 


The external flows may be deterministic or random, but we 


deal only with their expected values. 


Let, 
at) = expected flow from states in T; to state 
(7) adung period t, a Scalar: 
d; (t) = Sa OS Sa, on alx Wi vector; 
EGS?) = expected external flow into state (i,j) during 
period t, a scalar; 
f; (t) = ia Ar DI a l x w;, vector; 
955%) = expected flow from states in TiL] to state 


Gj) during period t, a scalar; 
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g; (t) = AN sn Ga) ee alx Ws wector. 
When i= 1, 955 °°) is defined to be zero. 
The relation between the flow vectors and the stock vector 
in grade i is depicted in Figure 1 where "TFt" denotes the 
states in građe i at time t. 


t-1 f; (t) 
g; (t) 


ln m; t S(t) |p., 


: E 
1 J i’ 


Figure 1. Stocks and Flow in Grade i in Period t. 


B. BASIC STOCK EQUATION 
By assumption Al and the interpretation of external flows: 


emGee= ca (cy + £,(t) + g, (t)- 


See Figure l. 
It will be convenient to define, 


sy (t) = 0, a vector of zeroes, 


Po = 0, a matrix of zeroes. 


Using conditional expectation we then have 


a; (t) S,(t-1)Q,, Igel, (en, 


g; (t) S; _](t71)P;_ 


os 
The basic stock equation is then, 
s; (t) = s; (t-1)Q; + f; (t) + S,_,(t-l)P,_), 1. < icn ni 
The basic stock equation for grade i can be written in 


terms of the expected or actual stocks in grade i in previous 
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periods. By recursively applying the basic stock equation 


or s,(t), S¿ (tl), s; (1) one obtains 


el Be a 
s; (t) = s,(0)0, + 2 (t-r)0," + Sind (EDO, 
Comer 2,8 ey (2) 
a 


which we will refer to as the cumulative stock equation. 
Equations (1) and (2) are used Bene in the remain- 

der of this report. Some manpower models used in the U.S. 

military for short-range forecasting consist principally of 


an application of an equation similar to (1). 


See TRANSIENT PROPERTIES OF THE STOCKS 

In this section we develop a method for expressing the 
stock vector as a sum of a "steady-state" component and a 
"transient" component. This method helps one to understand 
how the stock vectors will change in going from any present 
stock vector to future stock vectors. This method also helps 
one interpret the character of the limiting stock vector. 

We do not want to restrict ourselves to cases in which 
the stock vector converges (as t increases) to a finite 
vector, so it is best to specify what 1S meant by a "steady- 
state" component of the stock vector. We say that the vector 
function S, (t) is the steady-state comporent of the stock 
vector S. (t) if; 


lim (s; (t) - S;(t)) = 0. 


L>0 
For any sequence of stock vectors Seale) there is more than 


one choice of the steady-state component. In applications 
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one would prefer a steady state component having a relatively 
simple mathematical form. We show that in some cases a 
judicious choice of s, (0) makes this possible. The following 
theorem shows the properties of a class of steady-state 
components which can be quite useful. 

Theorem. For any collection of 1x Wi vectors s, (0), 
‚’’*,n, let the vector functions Ss; (t) satisfy 


rn nn 
fps (e-1)0, + f(E) ETS. y (E=1)P, 1 t=1,2,+ 0%, 


1’ 

i=1,-+--+--,n. 
i.e., the vector functions S; (t) satisfy the basic stock 
equation (1). Then. 


(a) the actual stocks at time t are 


N, i N 
s,(t) = s,(t) + ey $B (0) - 54 (0) Mg (ED, 


= Lv i av) 
(b) sites, = yey (8k (0) “Sy (OV BENG + 


alx Wi vector having finite 
components, 
te) S; (t) is a steady-state eonek of the stock 
vector s; (t), Liar 


lim (s, (t)-S, (t)) = 0. 


t>o 
Before proving the theorem it will be worthwhile explaining 

why one might be interested in such a theorem. Part (c) of the 

theorem says that s; (t) is a steady-state component of the 

stock vector s; (t), and part (a) shows how the stock vector 

S; (t) can be expressed as the sum of a steady-state component 


a eransient component. Part (b) of the theorem says that 
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the total over all periods of the difference between the stock 
vector and its steady-state component is a readily calculated 
finite vector. 

Such information can be useful when long-range planning 
has been done using an “equilibrium model." As an example a 
aer an Organization which intends to change from its 
present size of 250,000 to a size of 200,000. The manpower 
planner may use an equilibrium model to develop policies 
that are in some sense optimal, and these policies will main- 
tain the size of the organization at 200,000 people once it 
has been reduced to this size. So the equilibrium model tells 
the planner what to do once the size of the organization 
reaches the desired equilibrium level but it doesn't tell him 
how to change the size of the organization from its present 
level (250,000) to the desired equilibrium level (200,000). 
This problem of finding an optimal transition policy to go 
from present stock levels to a future equilibrium stock 
distribution is a difficult one. One method for making the 
transition is to immediately implement the hiring, promotion 
and attrition policies that have been derived from the 
equilibrium model. Because of the transient nature of the 
system (see assumption A2) these policies will eventually 
bring the stocks in the system to their equilibrium levels. 

In the theorem the vector functions =) play the role 
of what the stocks would be at time t if the system were in 
equilibrium. The stock vectors s; (t) indicate what the 
stocks will be at time t if we start with the present stocks 


s; (0) and implement the policies of the equilibrium model 
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(which are reflected in the external flows, fi (t), and the 


transition matrices 9: P. 


i and Aj). From part (a) of the 


theorem we may readily calculate the difference between 
actual stocks and equilibrium stocks in any grade and any 
period. If there is a penalty associated with having more 
people than the equilibrium stocks in the system, then part 
(b) of the theorem may be used to calculate the total penalty. 
Part (c) of the theorem assures the planner that the differ- 
ence between the actual and equilibrium stocks does converge 
to a zero vector as the time parameter t increases. 

The proof of the theorem follows. 
Proof. By hypothesis the vector functions S, (t) satisfy the 
basic stock equation (l), so they must aiso satisfy the 
cumulative stock equation (2): 

e t-1 


_ 4% t _ r von Kar: 
ES s,.(0)0, + ns Do it Be) Pee 


Ne 


E 


Of course the stock vectors s; (t) also satisfy the cumulative 


stock equation (2), so we have, 
AV) 7 = N t 
S; (t)-s, (t) = (s; (0) s,(0))0, 
t-1 > / E 
+ IS 1 (DDD P 194 : 


When i=1 this implies, 


II 


mn mn t 
sy (t) S} (t) t (s, (0)=s, (0))0, 


N 


nm Pm 


so we have shown that part (a) of the theorem is true when 
i=l. Suppose part (a) of the theorem is true for grade i-l, 


1.€., 
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a 1-1 y 
es, „ie + py (8k (0) “8 (0) IM gan (ED 


Then 
u Lat As 
S;_,(t-r-1)-s,;_, (t-r-1) = pay (8k (0) 84 (0) IM ae 


and, 


nm E _% t 


y 7 N y 
+ (s, (0)-S, (0))M_ . . (t-r-1)P. .O. 
E Ki i-1% 
2 A t 

i-1 ES t-1 a 


sro) = (0) ) Mo ¡oy (t-r-l)P,_10, - 
k=1 r=0 ”” 


Prom Equation (10) in Section E of Chapter II, 
El N 
l Me i-l (trI Pi Q = Malt), 
r=0 
so we have shown by induction that, 
t)-S. (t) = (s,(0)-=S, (0))0,+ + 7 (0)=S, (0))M, . (t 
This proves part (a) of the theorem. 


From part (a), 


li 


nm 1 mv 0 
s; (t)-s; (t) meet ))M, 5 (t) > 


09 


= N 
J (s.(t)-s. (t)) Y Y (s,(0)-8,(0))M,.(t) 
Oo -7 i E=0 k=1 5 6. 5 


} s y 
(s, (0)-s, (0) Mi CE) 
k=1  * oe 


1 N, 
we is, l0)-5,(0))B,,N; : 


li 


a lx Ww, vector having finite components. 
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The last step above follows from equations (6) and (8) of 
Chapter II. This proves part (b) of the theorem. 

Part (c) follows from the fact that the sum in part (b) 
is finite. N 

The utility of this approach depends on our ability to 
find vectors s, (0) such that the vector functions s, (t) are 
simple and readily calculated. The following subsections 
present examples. 

J. Fixed External Flows 

The equilibrium models previously mentioned enjoy some 

popularity in military manpower planning in the United States. 
The rationale underlying the use of such models is that one 
should determine the organization structure and the policies 
to maintain this structure which are optimal. Among the 
policies derived from an equilibrium model is the hiring 
policy. This has the form, 


f. (t) = f., 


jet = 12,00% = 1,°*-,n, 


where the vector of the number of people to be hired into the 
states in grade i each period, Lis is specified from the 
equilibrium model. 
Define, 
N 
Then using (1) it is easy to show that 
N 
s} (t) = fN} for all t. 


Thus, from the theorem 


s,(t) = $,(t) + (s, (0)-8, (0))M, y (t) 


t 
£,Nz + (s (0)-£,N,)Q, E 
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Now recursively define, 


mv mv 
Sy = 5, (t) = £, 17 

N nm i 

s, = (£; + S;_1P3_1)N, » LESA, IN (3) 


It is straightforward to verify that these S, satisfy the 
basic stock equation (1), so we have from the theorem, when 


E = Eo 


N, 1 A 
Sage) = sa, + L (S(O -Sp Mpi (E 


The steady-state component can also be written, 


) BL ¿Ns AL A (4) 


a 
Note that y f,B,. is a nonnegative 1 x w, vector, so the 
Ken k ki 1 


limiting vector of stocks in grade i must be a nonnegative 
combination of the rows of N.. In general, then, not all 


nonnegative 1 x w. vectors are possible limiting stock 


a 
vectors under constant external flows. 
2. Linear Growth of External Flows 

In this section we consider the case in which the 
number of people hired into each state increases by the same 
amount each period. Such a hiring policy may not be natural 
over a long period of time, but it may provide a simple 
approximation to planned hiring policies. 

Let the 1x Wi vector f. be the amount that the number 
hired into states in grade i increases each period. Then 


the external flow vector for grade i is, 


f. (t) = tf., A > d, een. 





Let, 
NL 
AI 519187 > 
Let the vector function S} (t) satisfy the basic stock 


equation (1), z 
Ay av 
S} (t) = S, (t-1)Q, + f,(t). 


Using the identity N.Q,+I=N one can show that 


1 
Emit) = GEN N.O.N 
S,(t) = tf Ny 12153 - 


Thus from the theorem, 


s(t) = tf_N 


17 fN QN} T (s, (0) + E 


1N,9,N,?9, 
We note that s(t) is of the form 


A 
S} (t) = tL, -1 Cy 


waere L. = Í. N 


is 2e xw. vector 
1 1 = i í 


and Cy = -£,N,O,N, is a lx Wi vector. 


Consider some grade ie{2,---,n}. Suppose that 


S E) = tı + 


where L. and C. are 1 x w. vectors. 
i-l ı-1 1-1 


Using the identify 


E a = 
(t£,N, 1 ,N,0,N;)0; + (t+1)f; ((t+1)£, N, -£,N,Q.N;), 


one may show that if 
hs 
S; (t) = tf. iN; (7E; N QN; +S, y (E72) PNG L541 Py ¡NjO¡N; » 
then s; (t) satisfies the basic stock equation (l). Note 
mn 
S 


i(t) has the forn, 
~ u hr (5) 
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where, 


eee Sees 
and 


pee ee) Ge ee mene gee 
Thus we have shown that when the external flows grow linearly 


the steady-state component of the stocks also grows linearly. 


By recursive substitution in (6) we have, 


E 
l 


L. = 
l 


l ma P- 


B N . o. 
k kkıı 
Note that this vector gives the expected number of visits 


to states in grade i of f =f, (ttl) it) entrants in grade 


k 
k, k=1,***,1. That is, the growth in the stocks in grade i 
each period, Le, equals the expected number of visits to 
grade i of the growth in the external flows each period in 
the grades less than or equal to 1. 

Both Li and C; have the fundamental matrix N, as a right 
Factor, so the steady state component of the stock vector, 
S, (t), must be a nonnegative combination of the rows of N;- 
This same resuit was observed in the case of constant 
external flows. 

In summary we have shown that by choosing 

EGO) = tb, + C, 


where L; = £f_N. when i=l, 


Tl 


=e + bs -1Pi-1]?2Nj + i=2,-"*,n. 
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and C; =£,N,0,2N, when i=1, 


-((L,_] z C;_ı)P;_ı t E.N.Q:)N; r VEZ, oe, 
then from the theorem the stock equation may be written 
N i A 
s; (t) = s; (t) + eg Ok (0) “Sy (0) M5 ED E 
3. Geometric Growth of External Flows 
In this subsection we will show that geometric growth 
of external flows leads (eventually) to geometric growth of 
the stocks. Geometric growth is a not uncommon phenomenon 
both in natural and man-made systems. Geometric growth is 
frequently a reasonable assumption for medium- to long-range 
planning in manpower systems. 
We consider the case in which the external flows 
ento the states in grade i are proportional to,a,known vector 


fi and grow geometrically at a rate ð.. Thus, 


a t = I 
fi (t) = OF Lis t u 172, t 
1i = Ie ein 
OO 
a 


When 0 SL the external flows contract rather than grow. 
If 8, is not an eigenvalue of 0; for k<i<n we may 


define, 
1 -1 
k 
If the states in grade i have the 0-1 visiting property then 


all eigenvalues of O: are zero and thus 9, > 0 is never equal 


to an eigenvalue of O; in this case. 
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The following identity will be useful: 
N.(6,) = ) (9) . 
1 K r=0 k 7 
From this it follows that 
N. (89,90, = 0, (-I#N, (8, )). 
Define, 
mn 
Then it can be shown that if 


E; 


m 
S} (t) = 6, 


£f,N,(8,). 
then Ne), t=0,1,°°°*, satisfies the basic stock equation, 


and from the theoren, 

S,(t) = 6,~ £,N,(0,) + (5, (0)-£,N, (9,))M, y (€). 
Note that the steady state component of the grade 1 stock 
vector grows geometrically ae the same rate as the external 
flows into grade l. 

Define, 

| isl 
B, į (8%) = M O E Peksi m 


Then it can be shown that if 


GOK) 6 By 5 (ON; (0,) 


then ner t=0,1,°°°, satisfies the basic stock equation (1). 
Note that in the limit the stocks in grade i grow geomet- 
rically at the rate cf the largest 8, where k <i. 


Define, 


mar (0. : k=1,***,1). 


M k! 


Sat 





The steady-state component of the stock vector 1S not in 
general a nonnegative combination of the rows of N, (as was 
the case with constant external flows and linear growth of 
external flows). Rather the steady-state stock distribution 
is a nonnegative combination of the rows of N; (9) - The 

rows of N, (0,,) need not be nonnegative combinations of the 
rows of Nos so the limiting stock distributions that are 
possible under geometric growth of external flows need not 

be the same as the limiting stock distributions under constant 


external flows and linear growth of external fiows. 
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IV. THE (GRADE, LOS) MODEL 


The (grade, LOS) model is a model of a graded manpower 
system in which the second characteristic is length of 
service (LOS). By a person's length of service we mean 


the number of periods that he has been in the system. 


A. BACKGROUND 
Various manpower planning models that account for both 
grade and LOS are presently used by the United States military 
services. The incorporation of length of eee into 
military manpower flow models is important in order to 
realistically analyze the policies of enlisted contracts and 
"retirement at 20." 
A person enlisting in the military service "Signs on" 
for a number of years, usually two to five years. In the 
past, approximately eighty percent of those entering enlisted 
military service left the system upon expiration of their 
mettial contract. This high attrition at initial contract 
expiration has had many effects, but two that are pertinent 
here are: 
1) There is a close relation between attrition and length 
of service for men serving on their initial contract. 
2) At least half the people in the system are serving 
under their initial contract, so it is generally quite impor- 


tant that this group be modeled accurately. 
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The inclination of the operations researcher may then be 
to model manpower flows by a Markov-type model in which the 
states are lengths of service. Such models are structurally 
simple and computationally efficient. The author has worked 
on interactive retraining models of this sort for the U.S. 
Marine Corps. See also Grinold, Marshall and Oliver, 1973. 
But in practice manpower flow models are typically only part, 
albeit a crucial part, of larger planning models. Budget 
planners and operational planners usually demand manpower 
projections aggregated by pay grade; the distribution of 
people by their length of service is usually of secondary 
interest. 

Faced with the demand for a model that aggregates by 
pay grade and knowing that a valid model must treat the 
effects of length of service the operations researcher is 
led to conclude that a (grade, LOS) type of model is 
appropriate. 

The foregoing discussion does not imply that the (grade, 
LOS) model is appropriate only for military organizations. 
Various members of the "English school" of manpower planning 
have investigated the appropriate distribution function for 
the length of time that a worker stays with a company 
[Silcock, 1954; Lane and Andrew, 1955; Bartholomew, 1959]. 
The constant failure rate distributions (exponential/ 
geometric) were found to be guite inadequate, indicating that 


attrition rate and length of service are indeed related. 
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B. DEFINITIONS AND DESCRIPTION OF THE MODEL 

Let a person's LOS be the number of times the person has 
been counted in the system. Recall So under our timing 
convention a person who enters the system during period t 
is counted for the first time at the end of that period (at 
time t), and 1S assigned an LOS of 1 at that time. A 
person's LOS increases by one for each successive end-of- 
period that he is counted in the system. 

It 1s assumed that once a person leaves the system he 
never returns. It is possible to modify the (grade, LOS) 
model to allow for departure and re-entry by assigning to 
each grade dummy states in which the LOS remains constant 
from one period to the next. In the model described here 
the "out of the system" state is treated as absorbing. 

The states of the system are defined by couples (1,)) 
where: 

(i,j) = the state corresponding to grade i and LOS j. 
mnie notation and results for the general two-characteristic 
model apply directly to the LOS model. In particular 
l(i) and u(i) are the lower and upper lengths of service for 
anyone in grade i. 

By definition as long as a person remains in the system 
his LOS must increase by one each period. (This is an 
example of a model in which all transient states have the 0-1 
visiting property.) Consequently, we need define only the 
following transition probabilities: 


ee probability a person in state (i,j) at the end 
J 
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of one period will be in state (i,j+tl) at the 
end of the next period, 

Ps = probability a person in state (i,j) at the end 
of one period will be in state (i+tl,j+tl) at the 
end of the next period, 

ajj = probability a person in state (i,j) at the end of 

one period will be out of the system at the end 
of the next period. 


By assumption Al: 


ore + ae + aij = l. 


The transition matrix A; is a Ws x l matrix: 


ee me ay" a, uli)? 


The transition matrix O; is w. x w. and has non-zero 


I l 


elements only immediately above the main diagonal: 


1) 
E a) + 
Rn (12) 
O; = 0 A 
PS a 
N di u(i)-1 
0 
The transition matrix P; is W, X Wii, and has non-zero 


elements only on a single diagonal band. If el 


and u(itl) >u(i)+tl, then P. has the form shown below, where: 


1) the first max (0,1(1+1)-(1(i)+D) rows arẹ zeroes 


2) the last max (0,u(i+1)-(u(i)+1)) columns are zeroes. 
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Thus, 


0 
0 
{ 
Í 
l : 
pP, = Peete) ) = 1 , (2) 
> De 
a, en 
se 
Pi ulıy 0 me 


Pewee) < 1(1), the first 1(1)+1-1(1+1) columns of P; 
Eu zeroes.  —1£ u(i+l) <u(i), the last u(ij+1-u(3+1) rows 
of Pp, are zeroes. Under any circumstances P; is a 


"diagonal matrix," (see Appendix B). 


Define, 
r+k-1 
q, (1 ,1+k) = e dij if k>l, 
j=r 
E 0 
= 10> 1E ESO. 


Then for x=0,1,+**,w,-1 we have 
O... 0 q, (1,1+k) 


0 q; (1+1,1+1+k 
k FR -s 
o. = o 
Hd ~ 
a q, (u-k,u) , 
0 
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where 1 = ee), 


u Der), 
and the first k columns and the last k rows are zeroes. 


k 


When k> Wi, O; =0. Recalling that 
_ u al 
+ . k 
zZ y O; , 
k=0 


we have in the LOS model, 


w¿=1 
N=] 0 
k=0 
I q; (1,141 q; (1,142) a g,(1,u) 
Ma a q,(1+1,1+2) +: q,; (1+1,u) 
E (3) 
q; (u-l,u) ' 
als 


The notation hides the rather simple structure of the 
fundamental matrix N. in the (grade, LOS) model. Consider 


the case where 1(i)=1 and u(i)=4. We then have for grade ii, 


I 1 Ya 94194 2953 
4 en 912913 
N, = : (4) 
di3 
1 


It is interesting to compare the structure of N. in the 
(grade, LOS) model with the structure of a matrix, denoted 
B, of the submatrices Bay from the general two-characteristic 


model: 
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B B B B 


11 12 18 14 
a ER. E 
B= 
eae B34 
Baa 


From Section B of Chapter II, 


k-1 
B., = II B., aw. xw 
i 1 


ma LX 
E rix; 
r=1 


k 


so the matrix B can be written, 


A E Bo Dabo; 
I B, BSB, 

B = (5) 
I B 


Recall that each element of B- gives the probability of 
entering grade itl in a particular state given the state in 
Which grade i was entered. Thus, the matrices B; summarize 
information about transitions from one grade to the next 

just as the A summarize information about transitions 

from one LOS to the next. In Chapter VI we consider combining 
states in a two-characteristic model; the foregoing discussion 
indicates that if all states in a grade are combined then the 
information originally summarized in the W, X Wij] matrix B; 
must now be summarized by a single a i.e. a Ixi matrix 
B; in the model that results from combining states. 


In manpower modeling the term cohort usually refers to a 


group of people who enter the system at the same time and in 
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the same state. For example the group of freshmen entering 
a college in a particular year comprise a cohort. After 
the cchort has entered the system we use the term cohort to 
refer to the members who are still in the system. 

Let us consider the movements of cohorts in the (grade, 
LOS) model when entry to the system is restricted to state 
(1,1). The cohort which enters during period ty must, at any 


subsequent time t, +t, be in the set of states (i,j) such 


1 
that j=t+1. Furthermore any cohort which entered the system 


during period t, # t, cannot at time t, +t be in any of the 


1 
states (i, t+1). Thus when entry to the system is restricted 
to a particular state, e.g., State (l,l), the (grade, LOS) 
model preserves cohorts in the sense that members of different 
cohorts never simultaneously occupy the same state. The motion 
of a cohort through the system is clearly described by the 
t-step transition matrices. 
Recall that the t-step transition matrix from states in 
grade k to states in grade i, M; (t); has elements: 
M(t; (k,m), (i,j) = probability of being in state (i,)) 
t steps after being in state (k,m). 
The row index of M; (t) is r = m - l(k) + l; the column index 
of M; (t) is c = j- l(i) +l. Because LOS increases by one 
each period, if 
am), (i,j)) 70, 
then j-m= t. 


But j- m= t implies that 


(c + 1(1) -1) - (r +1(k) -1) Er OE 


Cc - r t + (1(k) - 1(1)) . 


60 





Thus the non-zero elements of Mo; are in row r and column c 
such that 

c= r = t + (1(k) A Y (6) 
SO M; (t) is a "diagonal matrix" with index t+(1(k)-1(i)). 
(See Appendix B.) Furthermore the non-zero diagonal of 
M3 (ttl) must be immediately above the non-zero diagonal of 


M,; (t)- Recalling that, 


00 
l KE "= vr (see equation (8) or Chapter II), 


we see that the non-zero diagonal of M; (t) must be the 
set of elements of Vki in row r and column c satisfying (6). 


Thus in the (grade, LOS) model all t-step transition matrices 


can be readily derived from the matrices Vki’ 


C. COMPUTATIONAL CONSIDERATIONS 

Because of the sparseness of the O: and P. matrices it 
would be most inefficient to store and manipulate the entire 
matrices. In Appendix B the storage and multiplication of 
"diagonal matrices" is discussed. A "diagonal matrix" is 
defined as any matrix (not necessarily square) having the 
property that for some k the element in row i and column j 
equals zero if i-j#k. The matrices O; and P: in tne (grade, 
LOS) model are diagonal matrices. 

It is shown in Appendix B that the product of diagonal 
matrices is a diagonal matrix. Only ane elements of the non- 
zero diagonal and four numbers describing the diagonal 
matrix need be stored. To multiply an r xm diagonal matrix 


by an mxc diagonal matrix requires min{r,m,c} multiplications 
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as opposed to the product of r,m and c multiplications and 
additions in the general case. Consequently, in the (grade, 
LOS) model the fundamental matrix N, can be computed 
efficiently by computing the non-zero diagonals of successive 
powers of O; and combining these with an identity matrix to 


form the fundamental matrix N;- 


D. EXAMPLE: MINIMIZATION OF COSTS AT REENLISTMENT 

In this section we consider an example in order to 
illustrate how one might take advantage of the highly struc- 
tured form of the (grade, LOS) model to solve a practical 
problem. 

The U.S. Navy has used various reenlistment bonus plans to 
decrease the attrition rate at termination of the initial 
enlistment contract. The amount and effectiveness of the 
bonus depends on a number of factors; among them is pay grade 
at contract termination. 

We consider the case in which initial contracts are for m 
periods, and a reenlistment bonus is used to control the 
attrition rate a for such grades k that a is defined. 


km km 
To simplify notation it is assumed that A is defined for 
—o ates The cost of changing Am is specifieä by a non- 
negative quasi-convex function. Let C, (arm be the cost of 
a Alim decrease in Alem from its base value, and let Cy (0) = 0. 
Decisions are constrained by the requirements that in 
equilibrium the total stocks in each grade, s.1, must be no 
smaller than a specified lower bound S, All external flow 


is into grade 1 and f, is specified. 


il 
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Because under Al, 
Pim + Alem? ae = 1; 

uf apm is decreased then changes in either Pum OF akm must 
Sccur. 


Let, 


Adium = the increase in Iym ? 


APın = the increase in Prem’ 


Then, 
Adum + AP Ln = Na, n (7) 
The various transition matrices depend on the values of 
the Adem S and AP Ss and we denote this by Q, (Aq) 
Ny, (Adv) ’ P, (Ap, .) and By (AG jo) > To simplify notation some- 


what we define, 


O, = Q, (0), 
= ( 
N, N, 10), N 
By = By (0,0). 
The parameters of the matrices O, and PL are known, and N, 
and By are calculated from them. 
From the results of Chapter III we have 
i-l 
Shei £, N BL (Aq) 7 AP im) Ny (Adm) > (8) 
r=] 
In the case Aim = AP = Aa = 0 for all k we denote the 
stocks by Si 9 and, 
ne 
= | 9 
Sio £, Ji B N; (9) 
r=l 
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To minimize the total costs of the reenlistment bonuses 
while satisfying the constraint that total stocks in grade i 
be no smaller than S; we must solve the following program 


denoted P1. 


n 
Pl) min } Cy (Aa 


) 
k=1 alk 
Stes ses, , 1 = 1,:*:9,n 
i —`i 
Jol 
= (Acq. = l,» 
Ss; Y Pr dm Ap m Ni aim ” 1 1% en 


Adin + AP ym = Aart A n 

It will be shown that because of the structure of the 
(grade, LOS) model the constraints imposed by equation (1) 
are linear functions of the decision variables in program 
Pl. That is, program Pl requires the minimization of a quasi- 
convex function subject to linear constraints. 

The linearity of the constraints is shown in two steps. 
First, we show that they are linear when only one of the 
decision variables is non-zero. Next, we show they are linear 


for all feasible values of the decision variables. 


When Ady AP Em 48 70 we denote the values of the 


parameters Quem’ Pkm and Arn with hats, i.e., 


Tkm y dm’ 


a + APım ’ 
Em pS bam , 


akm 


The value of Si depends on Akm only through N, (89m) ° 
Let, 


j =m —- l(k) +1. 
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Then the only elements of N, (Aq, ) that are functions of 
Ikm are those in row r and column c such that r<j and 
c>j. Each of these elements has Im as a factor. Let 


N, be the Wi X Wk matrix that results from: 


1) setting dkm equal to one in those elements of Ny in 
which it is a factor, 
2) setting all other elements of Ny equal to zero. 
For illustration let 1(k)=1, u(k)=5 and m=3. Then, 


j=3-1+1=3. The fundamental matrix is 


| 
NGq = |) Ay Wg Ike ¡a 91% 2%3 1x4 
1 To | W293 TIA 34 

| 
l IK3 IK3Ika 
ae eee 0 oa 
1 Tk 

1 
ke 


where G43 = qdk3 => Adiz : 


The elements having dj3 as a factor are in the indicated 
Submatrix in the upper right corner of K, (Aq. ) ° Setting 
dk3 equal to one in these elements and setting the other 


elements equal to zero we then have 


GROS I a 
> 4x2 1294 
Nn ={0 00 1 ay 
0 0.0 0 0 j 
0 oO 0 0 0 
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From this illustration we observe that 
= m 
N, (Aq, m? = N, + Ag mx ; 
Note that Ne is not a function of q 2 
km 
Thus, if Adim Pim amd fonn- D n except that 


Ad m* 08 then the stocks in grade i are from (8) and (9) 


k-1 A i-l 
s.(Aq,_) = £,( I B_) (N,+4q,_N,)P, HI BUN. 
1 km l ER er: k km k Se Te 
k-1 ha i-l 
= otro L (I B)(N P.) I BN... 
10 km l En je k k epr EJ 
Define, 
k-1 a i-1 p 
dix = £, ( E B_) (N, Py) 1 B_N.1l, a scalar. >- (10) 
r=1 r=k+1 


Note that diz ls not a function of Gum OF of any Adin 

7 y T t 
or ÂP m" So we have shown that when all Adam s and APım S 
except Adin are zero then the total stocks in grade 1 are a 


linear function of AQ ym * 


Si dim) t = Sig + Fix dem 
We now show the total stocks in grade 1 are also linear 


in AP Lan when all other Aqy,p S and APam S are zero. 


The value of Si depends on AB only through P, (Ap, ,) - 


Let Ba be the matrix that results from setting Diem equal 


Po one: in PL (Ap) and setting all other elements of Py (AP, n) 


equal to zero. Then 
= p m 
Pi (Ap, _) oe) ot AB. 
_ ra 
= Py + AP, Py : 
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Now if Ag, m AP m lam? for l=1,°*>,nwex@ept Pum? 0+ 


then the stocks in grade i are 


Ka isd 
s, (Ap, ,) = £ (2 », IND 2. Ber 
k-1 mil 
= Sio + APimt y | y ) N, P k AN 
Define, 
k-1 tal $ 
ey, =f os JN, Py ie E (I 


Note that e., is not a function of Prem OF of any Adam or 
APs mn ; 
' t 
So we have shown that when all Adam s and APım s except AP, m 


are zero then the total stocks in grade i are a linear 


function of AP! 


S; (Ap, ,) 1 = Sio + Six AP * 


We call u the m-differential matrix of Ny and a the 


m-differential matrix of Pye 


The following lemma is used to show that when more than 
one Ag, m or AP Ln (or both) are non-zero, the total stocks are 
still linear functions of the Ag m s and APL in S 

tol 
Lemma. If in the product IB N, where B,=N,P any 


ES] k k kk 


two (or more) distinct matrices are replaced by their m- 
differential matrices, then the product is a zero matrix. 
Proof. Because all states have the 0-1 visiting property 
i-l 
in the (grade, LOS) model, IB N, =V] j? the wW X wW; matrix 


De E 
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of probabilities of visiting a column state in grade i 
starting from a row state in grade l. Each of these proba- 
bilities is the sum of the probabilities of all paths 
through the state space which start in the row state and end 
in the column state. 

Replacing O. DY Im times its m-differential matrix o 


is equivalent to extracting all terms in V which have dim 


Ji 
as a factor. This in turn is equivalent to restricting the 
paths from the row state to the column state to those which 
pass through both states (k,m) and (k,m+1). Similarly, 
replacing Py by Pim times its m-differential matrix ae is 
equivalent to restricting the paths from the row state to the 
column state to those paths which pass through both states 
(k,m) and (k+1,m+1). Consequently, replacing two or more 

Dt 


matrices in H B N,=V 


k by their m-differential matrices 
k=] 


37 


restricts the process to paths from the row state to the 
column state that have probability zero. O 
Thus we have. 
1-1 


Ey MP! Perm Ni Adim? 


un 
N 


sr wW raag NM (Pp rap PU (n.+Aq. N”) 
Sy r dem E r Prm Y i “dimi ° 


if 
Hh 


From the lemma all of the "cross product" terms, 1.e., terms 
containing two or more m-differential matrices, are zero 


Matrices, so we have, 
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1-1 k-1 i-1 
INTE + T Aa f, CT BON P (I BN, 


A A 
un k-1 i-l 
+ Ap (1 B)N TEEN 
= lan 1 r=] Y Krk mk © 1 


and using (9), (10) and (11), 
_ icl 
Pl s. l+ È tanik + pl Pix * 
This completes our demonstration that the constraints 


are linear in the decision variables. Thus the following 


program P2 is equivalent to the original program Pl: 
n 
P2] min 1 Cy (Ady t AP jem) 


i i-1 

ST i day nda, + l DD ee Sletten 
k=] 1=1 

The constraints need not be of the exact form shown in 


this example. The nature of the problem does not change if 


the constraints are of the form 
S.L. aS. y i=1,--»-,n 


where L; is a Wi X C; matrix, 


S. is ac. x L vector, 
i i 
and Cc. is the number of constraints on stocks in grade 1. 
The scalars dik and Six would in this case become ql x Ci 
vectors, and their defining equation would be modified by 
replacing 1 with L,- 
There are various efficient techniques for minimizing 


a nonlinear objective function subject to linear constraints 


which could be used to solve P2. The exact form of the 
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objective function might indiate the most appropriate 
technique, e.g., separable programming, quadratic programming 
Or gradient projection. The point of the example, however, 
is to show that one may take advantage of the structure of 
the (grade, LOS) model to solve problems that would be quite 


difficult in a general two-characteristic model. 


E. INTERCHANGING GRADE AND LOS 

When studying manpower flows with a two-characteristic 
model it 1S sometimes advantageous to interchange the first 
characteristic, grade, and the secondary characteristic. The 
feasibility of this depends to a great extent on whether the 
interchanged characteristic model satisfies assumption Al. 
That is, in the original two-characteristic model, is the 
value of the second characteristic restricted tc staying the 
same or increasing by one in each period? 

In the case of the (grade, LOS) model interchanging grade 
and LOS does result in a model which satisfies assumption Al. 
We consider some of the properties of this interchanged model 
which we call the (LOS, grade) model. 

Before interchanging characteristics it is convenient to 
expand the state space as necessary to insure that for each 
value of grade i and each value of LOS j there is a state 
(i,j). Recall that in the (grade, LOS) model we only define 
mesures (1,7) when 1(1)<3<u(i). Let u be the largest of 
the u(i)'s. Then in the expanded state space (i,j) is 


Beemmed for 1<j<u for i=l,*-+,n. 
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Now let us consider the structure of the submatrices 
in the overall transition matrix for the (LOS, grade) model. 
For any LOS j the n x n matrix Q; contains the transition 
probabilities for going from one grade to another while 
maintaining LOS constant at j. By the definition of LOS it 


must increase by one each period, so such probabilities are 


zero. Consequently, for j=l,°°*°,u, 
. = 0 
Q; r 
and, 
-1 
N. = (I-Q. = I. 
j ( Q.) 


For any LOS j the n x n matrix E contains the probabilities 
of going from one pay grade to another (or the same pay grade) 
while increasing LOS to j+1. It is here that we can see a 
possible advantage of the (LOS, grade) model since we can 
allow demotions and multi-grade promotions without violating 
assumption Al. If there are no multi-grade promotions and 
there are no demotions, then P) has non-zero entries only on 
and immediately above the main diagonal. If demotions are 
included in the model they cause non-zero entries below the 
main diagonal of 2y promotions of any type are reflected in 
non-zero entries above the main diagonal of BES Under any 


circumstances, 


If demotions are not included in the model, then By is upper 
triangular and so is the product of successively indexed 


matrices er 
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For any LOS j the n x 1 matrix Aj contains the proba- 
bilities of leaving the system from the various pay grades 
when length of service is j. 

The overall transition matrix P for the (LOS, grade) 
model has the same form as in any secondary characteristic 


model. The important difference is that in the (LOS, grade) 


model we have Q,=0. Thus, 


0 P 


1 A 
O W 

P = po An | 

= <> 
eN = a 

0 A 
AAA | E 

0 m 


To illustrate the effects of interchanging grade and LOS 
consider an example from the U.S. Marine Corps. The enlisted 
force of the Marine Corps has nine grades. The values of LOS 
range from 1 to 30. So in the (grade, LOS) model we would 
have nine 30x30 matrices O; and eight 30x30 matrices P.. 

Of course each of these matrices would have at most 30 non- 
zero elements. By interchanging characteristics, the (LOS, 
grade) model has all. matrices O; equal to zero matrices, and 
there are twenty-nine 9x9 matrices P.. Whether it is more 
practical to use nine pairs of relatively large matrices or 
twenty-nine relatively small matrices must depend on the 
problem to be solved. This example does illustrate that the 


modeler has a choice. 
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The fundamental matrix for 


process is, 


P P3 nn 


The expansion of the state 
we): =), “u , i=l,---,n} 


necessary. 


1(7) 


u(j) 
As before w¿=u (3-1 (3)+1. 
w3 x Wir and A, 
Substantially arfected. 


is WwW. X l. 
J 


reduced but computer 
We note that the 


equation (1)) in the 


a = 


Then 
Q) 


(LOS, grade) 


f.(t+l) +s. eye. 
5 § ) Sy ) j- 


the transient part of the 


Wage 
FU 


Li. 
Es 


Il AG 
FU 


LJ. 
N 
t= 


LW 
LJ 


tu 
H --- FG 
FU 


space so that all states in 


are defined is not always 


In the (LOS, grade) model one may define, 
lowest grade that a person having LOS j may hold, 


highest grade that a person having LOS j may hold. 


is W. X W., P. is 
J J J 


The nature of the model is not 
Computer storage requirements may be 
programming may be more complex. 


basic stock equation (see Chapter III, 


model is 


1 , 


reflecting the fact that the stocks of LOS j in the next 


period do not depend on the stocks of LOS j in the present 


period. 


III) is the same as the basic stock equation. 


The cumulative stock equation (equation (2), Chapter 


The following 


equation is derived from the fundamental matrix N3 and gives 
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the stocks in terms of the external flows in previous periods, 


: aa 
s.(t) = y f (t-m+j) I P, . 

J m=1 ™ k=m $ 

In summary the choice between the (grade, LOS) or the 


(LOS, grade) model depends on the information that is to be 


derived from the model. The (LOS, grade) model is more 


flexible in that the promotion/demotion structure is not 


restricted. The (grade, LOS) model might be more tractable 


when used as part of a budget model since costs are usually 


dependent on grade rather than on LOS. Under any circumstances 


the two models must yield equivalent information. 
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V. THE (GRADE, TIG) MODEL 


The (grade, TIG) model is a model of a graded manpower 
system in which the second characteristic is time in grade 
(TIG). By time in grade we mean the number of periods that 


a person has been in his present grade. 


A. BACKGROUND 

There are occasions when it is desired to model the 
flows of "career motivated" people in a manpower system. 

The operational definition of a career motivated person 
depends on the case at hand, but generally we expect such 
people to have no predilection to leave the system-at the 
earliest convenient opportunity. Rather we expect the 
decisions of career motivated people with respect to leaving 
the system to be closely related to ie recognition they 
receive for their performance in the system. One measure of 
recognition of performance is the combination of pay grade 
and time since last promotion. Such considerations may make 
a (grade, TIG) model appropriate. 

A somewhat different context in which the (grade, TIG) 
model may be appropriate is that in which the organization 
has an "up or out"policy. Positions in an organization may 
be (formally or informally) partitioned between those that 
are on the "path towards the top" and those that are 
terminal positions. The former group often has an up or out 


policy where "out" means transfer to a terminal position; the 





(grade, TIG) model can be used to examine flows in positions 


in this group. 


B. DESCRIPTION 

A person who enters grade i during period t is counted 
for the first time in grade DAN the end of the period (at 
time t) and assigned a TIG of one at that time. The 
person's TIG increases by one for each successive period that 
he is counted in grade i. 

The states of the system are defined by couples (i,j) 

where: | 

(i,j) = the state corresponding to građe i and TIG j. 

The notation and results for the two-characteristic model 
apply directly to the (grade, TIG) model. In the (grade, 
TIG) model. In the (grade, TIG) model the value of 1(i) is 
always one. 

We need define only the following probabilities: 

dij = probability a person in state (i,j) at the end 
of one period wll be in state (i,j+l) at the 
end of the next period. 

Pi; = probability a person in state (i,j) at the end of 
one period will be in state (i+1,1) at the end of 
the next period. 

ass = probability a person in state (i,j) at the end of 

one period will be out of the system at the end 
of the next period. 


Under assumption Al, 


ds + oe + a = l. 
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The transition matrix A; is a Ws x l matrix: 
A 


ene oa 


The transition matrix O; is Wi X Wi and has non-zero elements 


only immediately above the main diagonal: 


oa 


90. Mi 
N 

A 0 
ye 


di, ,u(i)-1 
EN 


0 


Ss 
~ 


The transition matrix pP, is Wi x Wi and has non-zero elements 


only in the first column: 


Pil 


O = — me (>) 


0 
| 
P. =| | | 
| i 

| Pi u(i) O 
Thus, A; and O; have the same form as in the (grade, LOS) 


model. Each fundamental matris N, also has the same form 


as in the (grade, LOS) model: 


u(i)-1 
ee ee dij 
J=1 
u-la 
a I A = 
m i 
oe t 
l 


1 





The matrix B,=N.P, has non-zero elements only in the 
first column because grade i+l can only be entered in state 
(i+1,1). Let the element in row j and column 1 of B; be 
denoted one Then, 

uk- 
AI mbar - 
Recall that B.1 is a w. x I vector, and ua | component 
is the probability of attaining grade i+l starting from state 


(i,j). In the (grade, TIG) model, 


B 1 = eae ae ’ 


SO, 


Des 


i5 probability of attaining grade i+l from state (i,j). 


The product of successively indexed Bs matrices has non- 


zero elements only in the first column, and 


k-1 
ll b A y 
E aL 
k-1 k-1 
IB = Des | b Oel 
mu m 12 itl ml ' | 
| | 
k-1 
low ox R b a = 20 
1,u(1) 447 ml 
k-1] Z 
= II D aB l; 0] 
m=i+1 mia 
Or 9,50: then state (i,j+1) is unreachable (except 


possibly through an external flow). If Tiare, then, 
o 1) 
Bel a. ° 
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This relation provides an efficient method for computing 


the elements of the vector B,1 recursively: 


Fr, Band): 


E = ER + A ee iu) e, LO 

The (grade, TIG) model is equivalent to a discrete semi- 
Markov process. The states of the process are the n grades 
and To (out of the system). Upon entering state i the next 


transition is to state i+tl with probability bs and" toni 


L 0 


with probability l-b, The tail distribution of the time 


1* 
spent in grade 1 is the first row of the fundamental matrix 
N;- The results from the two-characteristic model in 

Chapter II, Section E can be used to find the distributions 

of Me spent in state 1 conditioned on either promotion 

or leaving the system before promotion. Thus all the infor- 
mation required to set up the (grade, TIG) model as a discrete 
semi-Markov process is readily derived from the results 
established here. The use of the discrete semi-Markov process 
1s covered in some detail in Howard, 1971. Because of 
assumption Al there seems to be little advantage to treating 
the (grade, TIG) model as a semi-Markov process. If it is 
necessary that demotions and multi-step promotions be 

included in a model based on grade and time in grade, then 

the more general technigues of semi-Markov processes might be 
of value. We show in the following section, however, that 
interchanging characteristics in the (grade, TIG) model 

leads to a Markov model which allows the inclusion of 


demotions and multi-step promotions. This model is 
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computationally practical, but violates assumption Al, and 
thus 1S not in the class of models analyzed in Chapters II 


and III. 


C. INTERCHANGING GRADE AND TIG 

Interchanging grade and time in grade results in a model 
that does not satisfy assumption Al (the new first character- 
istic, time in-grade, decreases when the new second character- 
istic, grade, changes). Nevertheless, the (TIG, grade) model 
1S not too difficult to analyze, and it enables the modeler to 
provide for demotions and multi-grade promotions. 

Let (3,1) denote the state in the (TIG, grade) model 
corresponding to j periods in grade 1. As before i is in the 
set (ee and we will let j take any value from the set 
mr m} where the value of m is specified. Let P be an 
nxn matrix of transition probabilities for transitions from 
States in {(j,1):i=1,:--,n} to states in {(1,k):k=1,---:,n}. 
That is, A is the transition matrix for transitions from one 
grade to a different grade when TIG is equal to j. Changing 
grades, Of course, causes the value of TIG to change to one. 
The matrix E has zeroes on itS Main diagonal. Let Oy be an 
nxn matrix of transition probabilities for transitions from 
Bestest a) a= 1, + nó to states in [(3+1,1):i=1,:->,n), 
That is, Q; represents transitions in which grade is unchanged 
and TIG increases from j to j+l. The E O, has zeroes 
everywhere except possibly on its main diagonal. Let 2 be 
an nxl matrix of transition probabilities for transitions 


Mon states in {(j,1):i=l,°--,n} to out of the system. The 
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overall transition matrix P for the (TIG, grade) model is 


shown below: 


i | | I 
PS 2 y 22 
i ~ 
t N l ; 
A : = | 
a ei Qn-1 | eal j 
P A 
m | m 
0 lı 


The transient part of the process is represented by the 
submatrices in the upper-left part of P. We call this (as 
before) the transient matrix Q. By assumption A2 the 
process is transient, so the fundamental matrix, Nero) 
exists. Of particular interest is the nxn submatrix in 
the upper-left corner of N which we will denote by Nui: The 
rows and cclumns of Noy correspond to states in 
{(1,i):i=1,°**,n}. The basic theorem on the fundamental 
matrix [Theorem 3.2.4, Kemeny and Snell, 1960] indicates that 


the element in row 1 and column k of N equals the expected 


11 
number of visits to state (1,k) starting from state (1,3). If 
the 0-1 visiting property holds (which in the (TIG, grade) 
model means no demotions), then the element in row 1 and 
column k of N, ] equals the probability of attaining grade K 
given that grade i has just been attained. The submatrix N] 


may be computed in the following manner. 


ker, 

F =p, 

m m 

= = pP, + O j=l, --,;m-l, an nxn matrix. 
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The element in row i and column k of o is the probability 
of ever entering state (1,k) starting from state (j,i). It 
can be shown that, 

A 
and that the first column of submatrices of N, an mnxn 
matrix, 1S [N, 7 + FNyy rro o+F,Ny3] - Most of the first-order 
information of interest from the (TIG, grade) model is either 
contained in or readily derived from the PER column of sub- 
matrices of the fundamental matrix N. 

In summary, interchanging characteristics in the (grade, 
TIG) model leads to a model having a transient matrix struc- 
ture quite different from that of the two-characteristic model 
as developed in Chapter II. However, first-order properties 
of the interchanged model are not too difficult to obtain. 

The most obvious advantage of interchanging characteristics 


1s that demotions and multi-grade promotions may be included 


in the model. 
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VI. COMBINING STATES 


In this chapter we consider some of the mathematical 
properties of the two-characteristic model when the states 
in each grade are combined into a single state. The ideas 
discussed here are closely related to the concept of lumpa- 
bility as presented in Burke and Rosenblatt, 1958, and 
Kemeny and Snell, 1960. There is a difference between the 
approach taken here and that of the foregoing authors. Burke, 
et al. are primarily interested in establishing conditions 
under which the combining of states in a stationary Markov 
process leads to a process which is still Markov and sta- 
tionary and has these tiles for all (or at least some) 
initial probability vectors. In Section A of this chapter 
we briefly consider conditions under which the states of a 
two-characteristic model are lumpable. In Section B of this 
chapter we will consider combining states with the clear 
understanding that the resulting process may not be Markov 
or stationary for most initial probability vectors. Section 
C contains an example which illustrates the results in 
Fection B. 

We restrict our attention to the combining of all states 
in each grade so that the "reduced state space" version of 
the two-characteristic model will have one state for each 
grade and a state for "out of the system." Obviously such a 


combining of states leads to a one-characteristic model. 
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A. LUMPABILITY 
Let $¢(m) be the random function which indicates the state 
that a Markov process is in after m steps. Recall that T: is 
the set of all states in grade i. Let mt be an initial proba- 
bility vector over all states in the system. Then let 
(1) Pr [9(0)eT,] = probability the process starts in Ti 
given the initial probability vector 
T, 
(2) Pr [ö(mtl)eT,|d(m)eT,,""",6(1)ET,, 9(0)eT,] 
= probability the process is in Ti after m+1 
steps, given the initial probability vector 
T and the events o(O)eT., 
(1) eT, +++, g(m)eT.. 
The latter probability is not defined unless the given sequence 
of events has positive probability under the initial proba- 
bility vector T. 
Let T* denote the partition of the state space TUT, into 
the sets of states, Tor Tartt eTe The foregoing proba- 
bilities (1) and (2) define a stochastic process on T*, and we 


call this a lumped process. 


Definition. We say that the Markov chain in a two- 
characteristic model is "lumpable" with respect to the 
partition T* if for every starting vector m the lumped process 
defined above is a Markov chain and the transition proba- 


bilities do not depend on the choice of m. 


Theorem. A necessary and sufficient condition for the Markov 


chain in a two-characteristic model to be lumpable with respect 
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to T* is that for every pair of sets in T*, e.g., T; and The 
the probability of a one-step transition from any state 
(i,j) eT; to some state in Ty has the same value for every 
state in T;- These common values, denoted by da form a 
transition matrix for the lumped chain. 

The foregoing definition and theorem are taken from 
Chapter VI of Kemeny and Snell, 1960, with paraphrasing 
to suit the case at hand. 


By assumption Al for any starting vector , 
Pr_ [o(m+1)eT, |ó(m)eT_,+*+,$(1)eT,, p(0)eT,] 


can be non-zero only if t=s,stl, or 0. 


We define the W. X 3 matrix Pp. by , 


P; = [Q.1, P A.l]. (1) 


Corollary 1. The Markov chain in a two-characteristic model 
is lumpable with respect to T* if and only if for each grade 
i=1l,°°°,n the matrix p has identical rows. If the chain 


is lumpable, then the transition matrix for the chain is 


Pry Pip | Pro 
Pao P23 | Bao 
$ = Zr TT. 
A a Pn-1,0 
k- Pan | no 
0 © d 


where the vector (p. ) is any one of the identical 


ji’Pi,i+1’Pio 


rows of P;- 





Bovollany 2. A necessary condition for the Markov chain in 
a (grace, LOS) model or a (grade, TIG) model to be lumpable 


is that Q.=0, I, een. 


Proof. In the (grade, LOS) and (grade, TIG) models g. 0 


a O 
Ps gefinition. So, 

1) the last component of 0,1 1S a zero, implying 

2) the rows of Pia are not identical unless q,1=0, mplying, 

3) the rows of pP. are not identical unless Q.=0, implying 
w Corollary 1l, 


4) the chain is not lumpable with respect to T* unless 


0), A [j 


In the (grade, LOS) and (grade, TIG) models if 0.=0, 
1=l,-*+*,n, the lumped process is rather trivial because the 
only sequences of events that have positive probability are 
those of the form P(0)ET¡/H(1)ET ¡1 oe rólm)eT, , or such a 


sequence with events d(m+k) eT k=1,"***,Y appended. If the 


0 
starting vector T has positive probability only on T: or we 
are given that o(O)eT., then the lumped process is Markovian 


but the transition probabilities depend on the value of T, a 


Situation Kemeny and Snell call "weak lumpability." 


B. LUMPING AN UNLUMPABLE PROCESS 

It may be necessary or advantageous in some circumstances 
to use a one-characteristic model. If the system is more 
accurately modeled by a particular two-characteristic model, 
then we are faced with the problem of lumping a process which 


may be mathematically unlumpable. In this section we develop 
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methods for qualitatively judging what parameters might be 
appropriate when lumping a two-characteristic model. It is 
convenient to abandon the stochastic interpretation of the 
model, and adopt a deterministic fractional flow viewpoint 
of the process. 


A 


Recall equation (1), Pp; = 0,1, Pjl, A.1], a Wi x 3 
matrix. In this section we assume that the lumpability 
condition is not met, i.e., the rows of P. are not identical. 
We are interested in the following fractions for each grade 
i=1,°**,n: 

T) te fraction of those in građe i at t who remain 
in građe i for one more period, 

2) the fraction of those in građe i at t who get promoted 
to grade i+l during period t+], 

3) the fraction of those in grade i at t who leave the 
system during period ttl. 
The values of these fractions depend on the distribution of 


the stocks in grade i at time t. 


For any stock vector s; (t) £ 0, we define, 


a s. (t) 
s. (t) = ——, alxw. distribution vector. (2) 
i = F 
Sa (ETI 
Then, 
s.(t)0.1 s. (t)P.1  s.(t)A.l 
S, (t)P, = Ad ; I _ t1 i , a lix 3 vector, 


ao Ss. (t)1 „S; (t)l 
Define the following functions which map distribution vectors 


into scalars: 
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q; (5; (t)) EoI, 
D, (s,(t)) = s, (t)P.1, 

N E Ber 
a, (s; (t)) E S,(t)A,1. 


If the vector of stocks in grade i at time t is S, (t) #0 


and s,(t) is defined by equation (2), then, 


q, (5, (t)) the fraction of those in grade i at t who 


remain in grade i for one more period, 


% : i 
p; (s; (t)) the fraction of those in grade i at t who 


get promoted to grade i+l during period t+l, 


% i . l 
a,(s,(t)) the fraction of those in grade i at t who 
leave the system during period t+l. 


We note that, 
q. (8, (t)) +p, (8, (t)) 4a, (8. (t)) = 1. 


Define the following function which maps a 1x Wi 
distribution vector into a 1 x 3 distribution vector: 


N nn 
P.(s,(t)) 


I! 


N, AN, 
S; (t)P; 


I 


N, mo v E 
A eC (3) 
If the stock distribution vectors for grade i, Sauce) were 
known for t=0,1,2,***, then one could combine the states in 


09 


grade i and use the sequence <p, (5, (t)> to form a non- 


t=0 


stationary fractional flow model. 

The more pertinent situation is that in which s; (0) is 
known, but we do not know or do not want to compute s, (t) 
and S, (t) o nen. in the absence of information about 
future stock vectors we want to choose the parameters of a 


Stationary lumped process in some reasonable fashion. That 
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is, for each grade i=l,°*°,n we want to select three numbers 
(which we will denote Gir Py and a.) to represent the 
fractions of the people in grade i who stay in grade i, get 
promoted and leave the system respectively each period. The 
lumped process would have a state for each grade and one state 


for out of the system. The transition matrix would be, 


de ET | aT 
42 2 | 02 
e a. ' ) |. 
N ` Pn-1 | ni 
aw E 
0 AA a 


Under the assumption that the lumpability condition is 
not met, there is no choice of parameters for p that will 
in general be equivalent to the unlumped process. We may 
begin, however, by eliminating choices for P that are 
"obviously bad." 

Por any distribution vector AR the 1 x 3 vector 
pP, (8; (t)) is a convex combination of the rows of SE Let Hs 
be the set of all such vectors, i.e., 
3 


lave = s.P., 
IL 


H. = {h.€E 
i i 


1 À T 


In choosing parameters for the lumped process, any choice 


(44, P;r a,) which is not in H, is "obviously bad" because 


i 
here 1S no stock veetor for which this choice reflects the 


behavior of the system. 


89 





1 


The set H. is a subset of the fundamental simplex in 
3-space. 

Clearly (from Corollary 2) the Markov chain of a two- 
characteristic model is lumpable if and only if for each 
grade i=l,-::,n, H. is a single point. A plot of H: gives 
some qualitative indication of just how unlumpable grade i 
actually is. 

Next we consider the behavior of the sequence <P, (5, (t))> 
under various assumptions. 

l. Constant Stocks 

If the stock vector does not change with time, i.e., 


s; (t) ope eit, then the stock distribution vector 


ee 


is constant, 


ii 

E 
Mi 
ne 


Ny 
S; (t) 


un 
I 


and the logical choice of parameters for the lumped process is, 


= Ar 
(djr Ps, a;) = P;(s,) 





a 1 
= A 
= Pi > 
s.]l 
1 


because these parameters exactly reflect the deterministic 
flow behavior of the system. 
2. Convergent Stock Distribution Vector 

If the sequence of stock vectors, <s; (t)>, has the 
property that the corresponding sequence of stock distribution 
vectors, <s, (t)>, converges to a distribution vector i then 
the sequence of linear transformations of ke <P, (8, (t))>, 
converges to DB, (8,). In this case it may be reasonable to 
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choose the parameters of the lumped process by taking a 


n i A Pm = a 
convex combination of P.(s,(0)) and P.(s,). For example, 


A Ww > N 
(Gis Pjr a) = (1-0)P, (s, (0)) + OP, (S¡), 0586 < 1, 


where small values of 8 are appropriate for short-range 
planning and values of @ close to 1 are more appropriate for 
long-range planning. 
3. Constant External.Elows 
It was shown in Chapter III that under constant 
external flows, i.e., | 


f. (t) = f a l xw; nonnegative vector, 


1’ 


the- limiting value of the stock vector is 


Y 
| 
E] 

Hr 
e 

ve) 
Ww 
[ate 

Z 
H 


We define 


1 
g. = J f Bj? (4) 


a l xw, vector of inputs into each state in grade i in 
steady state. We assume 3: #0. Then, 

fu — 

S 


Mn s/s. 


Pr 


Il 


9,N,/(9,N,1) 


q HF 


de a 
E (5) 


I! 


( 





g;1 gil 
so with respect to S. we may without loss of generally 


assume that J; is a distribution vector. 


Let 


, EM calculated from (4) and (5)}. 
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That is, Hs is the set of limit points of the sequence 
A Pu 
<p, (s, (t))> under constant external flows. We show that 


: x 
(except for a special case) the set H; is a proper subset 


OF H.. 
1 
Let, 
AE 
N.. =J rowof N., a l] x w. vater, 
ij i Í i 
g.. = m element of g., a scalar 
ij ji’ r 
Tis = Nij l, a scalar. (See equation (4) of 


Chapter II.) From Chapter II we have that ig is the 
average time spent in grade i starting from state 


(i, j+1(i) - 1). Note that us E Let D; be a w. x Ww 
matrix having lti? ne) on its main diagonal and 
zeroes elsewhere. Then Dr 


i 


is defined. 
For any l x wW; nonnegative vector Gas let 


im 1 
g, = = (917411 $ ar: alx W; vector. (6) 
g.N.l 1 1 

Seas 

+ * a + e + 
Then it can be shown that g, 1s a distribution vector, and, 


Ss. = g¿N,/(9¿N;1) 


Jo. D. N (7) 


Let, 


II 


'S(w,) {all x Ww. distribution vectors}, 


m 
S (w;) {all 1 x Ws distribution vectors 
determined by (7) }. 

u 
Clearly, s(w,) is a subset of S(w.); we now establish condi- 


tions under which it is a proper subset. 
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Eguation (6) maps S(w,) into S(w;), and it can be 
shown that the mapping is onto. That is, for every EN in 
S(w;) there is a g; in S (w, ) such that equation (6) maps 
g; to g Thus, equation (7) may be viewed as a mapping 
from S(w;) into S(w,). We show, however, that the mapping 
in equation (7) may not be onto; i.e., there may exist 
distribution vectors in S(w;) which are not limiting stock 
distributions for any choice of constant external flows 
(recall that (4) gives g; as a function of the constant 
external flows). 

The W X W; matrix (D TTN) is a non-singular linear 
transformation from S (w; ) into Sm). It is well-known (see, 
for example, Halmos, 1956) that for any non-singular linear 
transformation T having determinant d and any measurable set 
E, with Lebesque measure m(E), m(E)/m(T ’E) = |d|. In the 
present context this can be written, 

m 
m(S, (w,)) 
m(S, (w,)) 


u -1 
= [det (D,"N,) |. 


But the matrix (D7 


N.) is nonnegative and its rows 
all sum to one (i.e., it is a stochastic matrix). So its 
eigenvalues are bounded in absolute value by +l. The deter- 


minant of a matrix equals the product of its eigenvalues, so, 


% 
m(S.(w.)) p 
er laet(D.n.) |<ı. 
m(S.(w.)) i i''— 
ly 
% 
Furthermore, m(S, (w;)) = m(S,(w;)) if and only if the magnitude 
of every eigenvalue of ie N.) is l. It can be shown (see 


Karlin,theorem 2.1, page 97), that this can only occur when 
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the non-zero elements of O: are restricted to the main 
diagonal. This would mean that a person entering grade i 

in state (i,j) could not make a transition to any other state 
in grade i. 


Z nm 
If |det (D;~ N.)|<1, then S. (w;) 1S a proper subset of 
S; (w.). The sets H; and Hy are linear transformations of 
n a, : 
S(w,) and S(w,) respectively, so |det (D, N.) [<1 implies that 
Hy is a proper subset of H,. 


When the set of states in grade i has the 0-1 visiting 
property we may assume without loss of generality that N, 1s 
upper triangular with ones on its main diagonal (see Chapter 
II, Section C). Then the determinant of N, is +l. The 


determinant of the diagonal matrix D; iS, 


W. 
i 
det D; = Ii Tea 
j=1 2 
Pee t..>1, so det D. = 1 if and only if all T..'s equal 1. 
ij- i ay 


When the 0-1 visiting property holds this requires that O: 
be a zero matrix. Thus, if the 0-1 visiting property holds 


and O; is not a zero matrix, then 
W. 

1 
( I Tia) < I. 
j=1 2 


OY 
m(S, (w,)) 
m(S, (w,) 
Furthermore any change in O; that increases the average time 
i nm 
spent in grade i, tijt will decrease the size of S, (w,) 
and thus decrease the size of HT. 


In summary, it has been shown that under constant 


external flows the vector of the fractions retained in grade, 
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promoted and lost from bue een converges to some point in 
HT which is in most cases of interest a proper subset of Hi. 
Thus, when the lumped process is to be used for long-range 
projections, choosing the parameter vector (9; » Pir a.) from 
HT would seem quite reasonable. 
4. Linear Growth of External Flows 

The case of linear growth of the external flows leads 
to results similar to those in the case of constant external 
flows. It was shown in Section C of Chapter III that under 
linear growth of external flows, £, (t) = tf. , the stock 
vector in građe i is asymptotically linear and 


s; (t) = tL, + Ci 


where 


I 


a = ES S) Ny 


C: 
a 


pee cgay ro EN ON, 


The limiting stock distribution vector is 


Lim ^% _ Kim = 

te j (t) = 430 S,(t)/s, (t)1 
_ Lim ; 7 
= ten (EL + C¿)/(tL; + C¿)1 
= L,/L,1. 

But, 
i 
L. = E TNA 
1 on ke Rie i 


Dan mn t 
So the limiting value of s; (t) is the same in the case of 
linear-growth external flows as it was in the case of 


constant external flows. Consequently the set of all possible 
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k 
limit points for the flow fractions, H; is the same under 


both constant and linear-growth external flows. 


C. AN EXAMPLE 
In this section we use an example to illustrate how the 
planner might use the results of Section B to combine states 


in a two-characteristic model. 





Let, 
0 2 0 0 7 0 0 0 
0 0 .5 0 0 4 0 0 
O; = a P; = t A. = e 
0 0 O .4 0 0 0 0 
0 0 0 0 0 0 0 .1 
Then, Ä 
Pi = [Q; 1, Pil, A,1] 
2 7 1 
5 .4 . 1 
4 Be 5 


> 


Clearly, the rows of P. are not identical, so the condition 
for lumpability is not satisfied. The rows of P, are the 
extreme points of Hay ahd H- is plotted in solid lines in 
Figure 2. Any choice of parameters for the combined process 
should be taken from this set H. 

Let the initial stock vector be, 


s, (0) 020770, 10, 0). 
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0.8 0.6 04 O02 


Figure 2. Plot of Points and Sets Used in 


in Combining States. 
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Then, 


s.(0) = s,(0)/s,(0)1 
= 022720270: 1,20, 
and, 
Av A l 
S; (0) P; = (0.43, 0.43, 0.14). 
That 158, 
™ 
qs (s,(0)) = 0.43, 
mn 
Pi (s; (0)) = 0343, 
S.(0)) = 0.14 
a; (s; ( )) = 0. 


The point (0.43, 0.43, 0.14) is denoted by a star in Figure 2. 
For a short-range projection (one or two periods) this point 
would be a good choice for the grade i parameters of the 
combined process. 

Next let us consider what choices of parameters would be 
appropriate for long-range planning under constant external 
flows. From equation (7) of Section B.2 we know that the 
distribution vector for the equilibrium stocks in grade i, 


y 
Sar must satisfy, 


vI a * -] 
= u eames 
* 
where J; is a distribution vector. In the present 
example, 

1 0.2 0.1 0.04 

0 1 0.5 082 
N, = i 

0 0 l 0.4 

0 0 0 1 


and the diagonal elements of D; are 1.34, 1.7, 1.4 and l. 
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Thus, 


„745 2350 5 — .030 
0 ‚590 295 05 

pit Nn. = 
a 0 0 ‚715 „285 
0 0 0 1 


In equilibrium under constant external flows the fractions 


remaining in grade, promoted, and leaving the system are 


given by S; ae and from the above we have, 
So P. = ao? NP. 
ri a ei 
755 50075. 030 22 y oa 
3 0 DO >. 2950. 115 5 4 
0 0 IVA. 285 .4 zal 5 
0 0 0 1 | y Hi 
J | 
2255 .590 -155 
„| »403 238 309 
zu: : 
. 286 . 100 .714 
0 - 100 200 


5 
Because gJ; can be any lxw, distribution vector, we see that 
the parameters of the combined process in equilibrium under 

A A i : 
constant external flows, S; P; , must be a convex combination 
of the rows of the above matrix. The set of all such convex 

. e e . * a a 

combinations has been previously designated Hey and it 1S 
plotted with dashed lines in Figure 2. For long-range 


planning the planner should restrict his choice of parameters 


x 
for the combined process to this set H; 
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Next let us assume that the planner estimates that in 
equilibrium the vector of the numbers entering the states 
in grade 1 would be approximately, 


(30, 50, 20, 0). 


Ji 
Then the equilibrium stocks would be 


Si = Ji Ni 


(30, 60, 48, 19.2), 


Ss = (101) eer esos. . 122). 
and, 

N, N 

Ss, P, = (.351, .329, .320). 


The latter point is denoted by a triangle in Figure 2. In 
this case a convex combination of the points denoted by * and 
A in Figure 2 would seem a reasonable choice of the parameters 
of the combined process, i.e., 
(q; Das a.) =O) sce, .o20 + (1.0) (243, .43, .14), 
where 0 < 0 < l. 
Small values of 8 are used for short-range planning, and larger 


values of 8 are used for long-range planning. 
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VII. AN APPLICATION TO RETRAINING PROBLEMS 


A. INTRODUCTION 
Consider an organization with people trained in various 


skills. Each person with a given skill belongs to a "skill 


group," which we will call "group" for simplicity. In this 


chapter we assume that the second characteristic in the state 
description is the group to which an individual belongs. We 
also assume that a person cannot belong to more than one group 
at a time. 

Group membership may be quite explicit as in the Case of 
the U.S. Marine Corps system of occupational fields or it can 
be implicit as in the case where group membership is determined 
by the number of years of formal education completed. In any 
case group membership defines a partition on the organization. 

In many organizations retention and promotion vary con- 
Siderably from group to group. Because of this it is often 
the case that people have to be retrained between groups in 
order to prevent surpluses and deficiencies of people in 
various skills. This is particularly true in the military 
enlisted personnel system. These retraining problems are the 
Subject of this chapter. 

We assume that there iS a nonnegative cost associated 
with the retraining of a person from se group to another. 

The meaning and the numerical value of a retraining cost is 
left to the planner. For example, a planner may express his 


opinion that people in group k are unsuited for retraining 
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into group m by assigning a prohibitively high cost to such 
retraining. 

Unlike the LOS or TIG models, in the retraining model we 
are quite interested in how many people change their second 
characteristic each period. We assume that assumptions AO, 
Al and A2 of Chapter II still hold. 

We are interested in long-range planning, so we begin 
with an equilibrium model, i.e., a model in which we assume 
that stocks and flows do not vary from one period to the 
next. The equilibrium model is very useful in determining 
achievable goals for an organization. Once determined, these 
goals can be used to judge short-range manpower policies. 

For example, use of the equilibrium model might show a long- 
range requirement for retraining into the sonarman group, 
and there might be a present Surplus of sonarmen. The goals 
from the equilibrium model would cause us in this case to 
question a short-range recommendation to convert sonarmen 
training facilities to some other use. 

In a retraining problem the planner would ideally like 
to specify the stocks and promotion rates for each grade/ 
group combination (state) and then minimize retraining costs. 
Unfortunately, this is not generally possible, and the planner 
must compromise or trade off between desired stocks, desired 
promotion rates and minimal retraining costs. A major 
purpose of our development of the retraining model will be 
to show the close interaction between stocks, SER rates 


and retraining costs. Consequently, we take a descriptive 
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rather than a prescriptive viewpoint of the retraining 
problem. The prescriptive approach would require the planner 
to specify a mathematical description of his preferences with 
respect to tradeoffs between stocks, promotion rates and 
retraining costs. Such a requirement is usually quite 
impracticable. 

In the following development of the retraining model the 
attrition rates are treated as known and fixed. This may not 
be entirely acceptable. Retraining can be used as a method 
of reducing attrition. It can be used as an inducement to 
people to remain in the organization. This problem has not 
been treated here. 

One purpose of the rasen analysis of the retrain- 
ing problem 1s to form the basis for an interactive computer 
program. Such a program would ET the planner in finding 
acceptable and feasible combinations of stocks, promotion 
rates and retraining costs. A device to be used in the 
interactive program, which is used in the mathematical 
analysis below, is proportionate control of stocks and 
promotion rates. For example, for a given grade the planner 
1s required to specify the proportionate promotion rates for 
the various groups in that grade. This device may appear 
artificial; however it has great practical advantages. The 
U.S. Department of Defense is considering a promotion policy 
in which promotion rates within any grade must be equal for 
all groups. It is not our intention to join the debate over 
the efficacy of such a policy, but our resuits are quite 


useful in investigating the implications of such a policy. 
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B. DEFINITIONS AND THE BALANCE EQUATION 

The organization 1S partitioned into states according to 
grade and group. It is assumed that transitions between 
states can be modeled by stationary fractional flows. The 
organization is assumed to be in equilibrium; stocks and 
flows do not change from one period to the next. One-period 
transitions from any grade are restricted = the same grade, 
the next higher grade or out of the system (see assumption Al). 

There are K groups indexed by k=1,°°-,K and n grades 
indexed by i=1,***+,n. State (i,k) corresponds to grade i 


and group k. We define 


Si, = number of people in grade 1 and group k, 
S; = (Siqr Sig) i a l x K vector, 
Liz = number of people who enter the system in state 


(i,k) each period, 


f. = (f, 


i 311 ct 


ig)! a l x K vector, 
a., = fraction of those in state (i,k) at the end of one 


period who leave the system during the next period, 


a. = fa. a K x l vector, 


4 Eva. 


ix! r 


= fraction of those in state (i,k) at the end of one 
period who are still in grade i (in any group) at 


the end of the next period, 


Dis — fraction of those in state (i,k) at the end of one 
period who are promoted to grade i+1 during the 
next period. 

Py = [Pi] tt Pig] a K x 1l vector. 


104 





From assumption Al we have 

MEME p ik T 

Nonnegative proportionality constants for the promotion 
mates, la, :+i=1,***,n, k=1,°°°,K}, are specified so that in 
each grade i, 

Pik = PE ye Kelas ok, (1) 
where Pi is the maximum promotion rate over all groups in 
grade i. 

Let, 
a ee, 


equal to l. Thus, 


J], a Kx 1 vector with maximum element 


Py = PU; = 


and we see that whatever the promotion rates for the various 
groups in grade i may be, they will be proportioned according 
to the components of Qs. 

We begin by considering all K groups in some grade i. 


The number of people entering grade i during any period is: 


K K 
No. into grade i = l fik + 2 5 A ED; 2 Eh 
k=] k= 
E p i (3a) 
The number of people leaving grade during any period is: 
} } 
No. out of grade i = ae. + S..P: 
k=1 ik ik En Ik 2k 
= s,a, + S,P,; 
= s,{a. + p.). (3b) 
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Under the assumption that the system is in equilibrium, the 
numbers entering and leaving a grade each period must be 


equal. Thus 


f.l + Si_1Pi-1 = S; (a; + P;)- (4) 


The above equation will be referred to as the balance 
equation for grade i. 
C. PARAMETRIC CALCULATIONS FOR PROMOTION RATES 

For specified stocks we use the proportionality constants 


for the promotion rates to solve for the promotion rates one 


grade at a time starting with grade 1. 


ee er 


I 
Nn 
y 


(Recall the p.'s are scalars.) 


1 
Thus, 
a 3 fyl gl 
a 
1 5% 
and 
* 
E PP ° 


For gradesiii=2z,.+. 1: 


A laa) 
= + p.s.0 
NS E oe 
Thus, 
oo o a S a 
i SA. 
IE 
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and, 


aes 


Note that when the stocks are specified we can solve for 
all the promotion rates without explicitly considering the 
retraining flows between groups. 

We note that: 

1) f.l is the number of people who enter grade i each 
period from outside the system, 


2) is the number of people promoted into grade i 


Si-1Pi-1 
(from grade i-1) each period, 

3) S.a; is the number of people who leave the system from 
grade i each period. Thus p: ave and only af the total 
number entering grade 1 each period is no smaller than the 
number leaving the system from grade i each period. 

Feasible promotion rates must satisfy OPS | - as, - 
The values of Pix computed above from the balance equations 
may not satisfy these constraints. It may also be the case 
that the promotion rates computed above are unacceptable for 
practical reasons. We next consider how one may trade off 


between stocks and promotion rates to obtain a satisfactory 


feasible set of rates. 


D., PARAMETRIC CALCULATIONS FOR STOCKS 

Let us suppose that the specified stocks yield infeasible 
promotion rates in grade i, e.g., p: í o. We might then 
approach the problem from the opposite direction in grade 


i by specifying the promotion rates and solving the balance 


equations for the stocks. The difficulty is that the solution 
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to such a problem is not unique. We will discuss two 
parametric methods that lead to unique solutions for the 
stocks in grade i when the values of the promotion rates 
in grade i are specified. 

In the context of interactive programming the parametric 
calculation of stocks is used to make adjustments when the 
initially specified stocks have led to infeasible or unaccept- 
able promotion rates. In Method 1 a specified proportionality 
between stocks in the various groups is maintained. In Method 
2 we maintain a specified proportionality between the 
deviations of the stocks from their desired levels. 

1. Method 1 

In this method we specify nonnegative proportionality 
constants for the stocks in the various groups in grade i, 


K 
L e Legs shen, 


{y.,:k=1,---,K} where i 


k 
— * a 

ap eee © eo) 

where s; is the total number of people in grade i. 


Bet, 


Ver (Vir Yi) a 1 x K vector. 


E 
s. = S¿Y; + (7) 


Let the promotion rate vector for grade 1, Pir be specified. 
The balance equation (4) for grade i is 


O EN 


* 
s.y,(a, + Pi) . 
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Thus, 


.* = a] | ig 
1 Y; (a; + py) 
and, 
E 
Be «(Oa ak? 


To interpret these results note that, 


* 
Sige? Ps) 


Ye 


iai i Py) 


I! 


fraction of those in grade i who 
leave grade 1 (by attrition or 


promotion) each period. 
2. Methed 2 
In this method we specify proportionality constants 
Mor the changes in the stocks in grade i, (8,1; k=1,. Pe} 
We suppose that the desired stocks in grade 1, denoted by 
tsik’ 


rates, and require that the stocks satisfy not only the 


h=1,°-°°,K} lead to infeasible or unacceptable promotion 


balance equation but also the parametric constraints, 


S., = (l + C; 6,,.)8 


ik LK 


where C; is a scalar to be determined. 


Let, 
Os = (Osage tts Oude a 1l x K vector, 
Ss. = (Seyret t tr Sands a l x K vector, 


aná define an operator A such that EN x EN FEN and 


“a IN 


ô. @s 


i cena,’ $ 


ixsix): (See Appendix A.) 
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We then have 


» = * + ` * S + + 
Si Si “i 1 1 


Using Method 2 parameterization the balance equation 


(4) for grade i is 


SEA > Ss la] + Py) 
= (s tie 6.@5,) (a; añ pj? . 
Thus, 
ee PEt PD 
i eS A 
and, 
Sik = (1 T GT 8.4) Sik . 


One advantage of Method 2 is that the stock in some 
group k can be held constant at the desired level Sag by 
assigning Six a value of zero. 

A significant difference between the two method of 
parameterizing the stocks is that Method 1 always yields 
nonnegative stocks while Method 2 may not. In the initial 
stages of an investigation of retraining policies, Method 1 
would seem more practical. After feasible and not-too- 
unacceptable policies have been derived, tnen Method 2 could 
be used to advantage. Only Method 1 will be used to para- 


meterize stocks in subsequent sections. 


E. RETRAINING FLOWS 
Once we have arrived at a set of stocks and promotion 
rates that are acceptable, we may compute the required 


retraining flows. Let, 
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= number of people retrained out of (>0) or into 


r. 
1k 
(<0) state (i,k) each period 
me PETER UE a l x K vector. 


We derive an expression for the retraining flow r., by 


IR 


considering the numbers of people that enter and leave 
state (i,k) each period in the absence of retraining: 


No. into state (i,k) = f,, + Ai 


No. out of state (1,k) = S.1 (aj, + Ps y) > 


Under the assumption that the system is in equilibrium, the 
flows into and out of state (1,k) must be equal; we use the 


retraining flow r,, to bring about this equality: 


1k 


r.. = (f., + 


EK ik ee a 


A, (3) 


The balance equation (4) ensures that 


I = 0, 


1 


I DIN 


ik 
k 
i.e., everyone retrained cut of one group is retrained into 


another group. 


F. THE RETRAINING TRANSPORTATION PROBLEM 

We assume there is a known cost c, (k,m) for retraining a 
person in grade i from group k to group m. We assume that 
all costs, including c,(k,k), are strictly positive. 

For any set of stocks and promotion rates in grade i that 
Satisfy the balance equation, we may compute the retraining 
flows, tr. J, and treat these as supplies of people for 
retraining (rs). pO) Mandedemeumas for retrained people Ca 
The balance equation ensures that total supply equals total 


demand. 
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Let, 


x. (k,m) = number of people retrained from state (1,k) 
to state (i,m) each period. 
In order to match supplies and demands at minimal cost, we 


must solve the following linear program. 


K K 
Pl] min ) ) Cc, (k,m) x, (k,m) 
h=1 m=1 
K 
ST ey - x, (m,k)) = Een Bl > SR 


x. (k,m) mp Orca kK ee PK. 


The equality constraints in Pl ensure that for each state 
(i,k) the net number of people retrained out of or into state 
(i,k) matches the supply of or demand for retrained people 
For that state. 

We note that, 

K K 

) (x, (k,m) - x,(m,k)) =  ) (x,(k,m) - x,(m,k)) 
m=1 m=1 
mk 


is an identity. That is, x; (k,k) does not appear in the 


equality constraints of Pl. The nonnegativity constraints 
do reguire x. (k,k) 2 0. Because we assume c. (k,k) > 0, the 
optimal solution to Pl will always have x. (k,k) = 0. 


The dual of Pl is denoted DIL: 


K 
DI] max ) roy Vay 


k=1 
ST Vik T is = c; (km); EE ER. 
vn WUNRESTRICTED, k=1,* ++ ,K. 
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It will be convenient to define the following vectors: 
x. = [x; (1,1),°++,x; (1,K), x; (2,1) ,°°° ,X:(K,1),***,x,(K,K)], 


a K? x 1 vector, 


E = aj? alo a K x 1 vector. 
The linear program Pl is quite similar to the classical 
transportation problem. It becomes a transportation problem 


if we make the following assumption. 
Assumption A3. Bor äanyahyj,mwell,''’,K), 
c; (k,m) < c; (k,j) + c; (j.m). 


The practical implication of this assumption is that it 
must be cheaper to retrain a person from group k to group m 
than it is to retrain one person from group k to group J 
and retrain another person from group jJ to group m. 

The theoretical implication of this assumption is that 
the optimal values of some readily identified variables in 


Pl must be zero. 


Let, 
A = e 
Bester. > 0), 
Ki = es Pix O 


Consider the case in which a the optimal solution 
to DI is o and the optimal solution to Pl is ae 


Suppose that xo (k,3) > 0. By complementary slackness it 
must then be true that, 


0 oe 
Vip T Vi =: c.(k,j).- (10) 
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But ER so there must be some m such that x5 (Jem) 2495 
This implies by complementary slackness .that, 


O om i 
Vij AE c; (Jm). (11) 


Adding (10) and (11) and using assumption A3, we have 


O O ; ; 
Vik S Vim = c; (k,J) $ c; (J,m) > c; (k,m), 


implying that the optimal solution to Dl, os is not feas- 
ible, a contradiction. Consequently, it ano be true 
that Ze > 0 when jek,. By a sımilar argument, we can 
show that it cannot be true that x: (j,m) > 0 when jeK, . 

Thus assumption A3 implies that in the optimal primal 
solution, if xo (km) > 0, then k €K}, meK, . Removing the 
variables which must be zero in any optimal solution from 
Pl leads to the primal and dual equivalents of PJ] and Dl 
under assumption A3: 


PRTP] min y y c,(K,m) x, (K,m) 


kek’ meK. 
1 $ 
ae u + 
ST u x, (k,ım) = r,,, keK, 
Ss 
bt - x, (k,m) =r, 7, meK,; 
i 
X. (km) = Oe 
K 
DRTP ] max pe Fix Vik 
- + _ 
ST Vik - Vig < Cc; (k,m); KEK., mEK 
V;y UNRESTRICTED. 
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The abbreviations PRTP and DRTP denote the primal and dual 
retraining transportation probiems respectively. The 
programs PRTP and DRTP are in the form of the classical 
transportation problem. 

We should pause to note some of the assumptions inherent 
in modeling retraining costs in this rather simple manner. 

l) There is no provision for set up costs. 

2) Each grade is treated as a separate problem. 

3) Marginal retraining costs are assumed constant with 
respect to the number retrained. 
The computational tractability of the simple transportation 
model is no small consideration, and one suspects that the 
model is adequate for the purposes at hand, viz., long-range 
Plannung. 


Let the optimal value and the optimal solution to DRTP 


O 
be denoted a; and vs = re ‘,Vix] respectively. Then, 
O y O 
d. = I eee. (12) 
l k=l TRIR 


We note that the optimal solution to DRTP is unique only to 


an additive constant. 


G. RELATION OF RETRAINING COSTS TO STOCKS AND PROMOTION RATES 
In this section we assume that the stocks and promotion 
rates in grade i-l have been determined, and we consider how 
our choice of the stocks and promotion rates in grade i will 
affect the optimal retraining costs in that grade. 
We will parameterize both the stocks and the promotion 


rates in grade i using Method 1 for the parametric calculation 
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of the stocks. Thus, we must specify two sets of propor- 
tionality constants (see equations (2) and (7)): 


1) la... k=1,--+ + W} for the’ prometion rates, 


2) Yıy Kel, 2 ER for the stocks. 


We consider the case in which the total number of people 
entering grade i each period is known and fixed: 


£; Sy @ Pi = g;, a known l x K vector 


ea f Si-1Pi-1 = g.i, a scalar. 


The stocks and promotion rates for the various groups in 
* * 
grade i are determined by the choice of Ss, and P; : 


= 5 


The balance equation (4) places a constraint on the choice 
x 


* . 
of S; and Pp, - We consider si as the independent variable, 


so the balance equation, 


g;1 = s, (a, + py? 


_ * * 
en zz 
Xx 
leads to the relation of the dependent variable P; to the 


* 
independent varıable S;: 


This relation is sketched on the following page. 
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Figure 2. Sketch of s and a 


The curve is asymptotic to the lines s; = 0 and 


> Pa 
Mie Tea a 


We must have pP; Poren heey (13) is equivalent to 


a g;l, i.e., the total number leaving the system from 
grade i each period must be no greater than the total number 
entering grade i each period. It is only when the above 


inequality is strict that we may (indeed must) promote people 


Out of grade i. 
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: * * 

Let us suppose that the couple (S., P;) satisfies (13). 

We then use the retraining flow equation, 
E e. * 
ee ewe ee A eee (9) 
to determine the RHS of the retraining transportation problem. 
The value of r., depends on s* and po. Because po is 
1k 1 1 l 

uniquely determined by z from (13), we may consider rs, a 
function of sE Substituting the expression for pa from 


(13) into the equation for r., we have, 


x x g;l - sįYjāj 
51 (S;) = Ores SY (Bi + ee) 
“aed 


After some manipulation this leads to, 


Sky. e EA EEE — Ei 
ash et An 


g. 
E | 
A (s;) = g;1 a 
5 mi 








ee (14) 


We assume the vector of group-to-group retraining costs, 


C. 


i! is fixed and known, so the optimal value of the retrain- 


ing transportation problem is a function of the vector of 
retraining flows, r. But we have shown that for fixed flow 
into grade i, 9;1 the retraining flows are a function of zw 
so the optimal value of the retraining transportation problem 
is a function of S We show this by rewriting the primal 


and dual of the retraining transportation problem. 


PRTP} mind, = } ) c, (k,m)x, (k,m) 


kek’ mek. 
i i 


u * + 
ST ) Bl) = Pix (Sj)r KEK; 


mEK. 
1 
y y * > 
ao = -Iri mti)? mek; 
keK. 
1 


x, (k,m) > 0. 
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K 
+ * 
DRTP] max d;(s;) l Yig(Si) Vik 
k=1 
ST v., - < C. (k,m); kek} eK. 
IE m we i ae 
Vinee a UNRESTRICTED. 


The question that we address in this section is how the 
optimal retraining costs vary with respect to the total 


stocks s! . 
1 
For some specified values of si let aĵ (sf) be the optimal 


value of the retraining costs and let v9 (s7) be the optimal 
solution to the dual retraining transportation problem. 


We then have, 





O * 
cer . 3 Seh) 
ds! s! A ie wa IRI 
K S Ive (s* 
k=] oS. oS. 


From (14) we have immediately, 











* 
art ae ey irtir _ Yirdik 
* a a een = a 
ee Il TO. Y;a; 
a 
The question then is, how does the dual optimal solution 
or o * . k : i * 
v;(s;) vary with S;- We note in the dual that varying S; 
ees * 
aoe not alter the feasibility of AE We note in the 
* 
primal that varying S; varies the RHS of the program, so the 
primal optimal solution must vary with S;- But suppose that 


the primal basis that was optimal at = is still feasible at 


* . a a + è 
S; t e for le | sufficiently small. Then this primal basis 


IES 





is feasible, it satisfies the complementary slackness 

: : : O * O * : ‘ E 
condition with respect to v.(s.), and v; (s;) is feasible in 

* a . 
the dual, so vi (s;) is still the dual optimal solution. In 
summary, the dual optimal solution, ver) does not change 
5 * 

with Ss: so long as the primal basis which was optimal at S; 
remains feasible. (See Dantzig, 1963.) 

Consequently, if for some €>0 changing the stocks from 

* 


* f a 
S; to Si + € does not cause a primal basis change, then, 


ov, (8) 





= 0, 
Js" 
1 
and 
3af (s7) Y. Y. a 
1 ik ik ik ik 
AS I (s; A a) (15) 
Is; De an en 
i 
We note that 
s'y.a S.a 
1 i? 1 1.1 
1) SS * = = 
S S.l 
l 1 


fraction of those in grade i who leave 


the system each period 


* 
E o a AE 
y.a. * S.a. 
i i S. Y.a. Ii 
Ii i 
= fraction of those leaving system from 
grade i who leave from state (i,k) 
3) a i ik _ PikPik 
Yii s* "0 S;P; 
iViPi“j 


fraction of those promoted from grade 


i who are promoted from state (i,k). 
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I epeejal Case 1 -- Costs Constant 


Special case 1 is that in which. for some positive 
comstant cC, 
Gik = Caig’ KERS K., 


In this special case, 


SS 
= -k 
A AR 
li eee ey Kae 
Yırkır Yiktik ; 
Tz f 
Yii 1 


ES g.1(— a 
1k 1 9.1 Y;a; 
1 
de * 
dd. (s;) 
A is 
dS; 


Thus, when promotion rates are proportional to attrition 
rates, the retraining flows do not depend on the total 
stocks T Consequently, the optimal retraining costs do 
not depend on sa 

The sensitivity of retraining costs to the total 
stocks depends in some sense on how promotion and attrition 
rates depart from the foregoing special case. 
| 2. Special Case 2 -- Costs Increase with Stocks 

A second special case is that in which for each 
ke{l,-°+,K}, either, 
Desk, Miki 


1) re 
g 51. 





ars ath. a an d 
Yi de Yi 38 
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We note that, 


ik = fraction of those entering grade i who enter in 

g9;1 group k. 
Thus, condition 1) above indicates that the fraction of grade 
i "entrants" entering group k is smaller than the fraction of 
grade i "promotees" promoted from group k which in turn is 
smaller than the fraction of grade i "leavers" leaving from 
group k. From equation (14), condition 1) implies that both 
Bis.) and its partial derivatıve with respect to s; are non- 
positive. That is, there is a demand for retrained people in 
group k of grade i, and as the total stocks in grade i 
increase so does this demand for retrained people in group 
k. 

Condition 2) is simply condition 1) with the inequalities 
reversed. Condition 2) implies that both rx (s;) and its 
partial derivative with respect to = are nonnegative. So 
there is an excess of people in group k of grade i, and as 
the total stocks in grade i increase so does this excess of 
people in group k. 

In this special case increasing total stocks increases the 
Supplies in the groups that have people available for retrain- 
ing and increases the requirements in the groups that have 
a need for retrained people. Consequently, increasing total 


Kr N: 
stocks, S;, increases the optimal retraining costs. 
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3. Special Case 3 -- Costs Decrease with Stocks 
A third special case is that in which for some values 
of si we have for each kell,---,K), eveher, 


Yırfık y Yixtik 


x 
1) r..(s.) > O and 
1 Yi T Yyy 


t 
or, 


Yırfır _ Yiktik 


E 
sica E sc 
Yi 1 Yi 1 


2) re) < Olana 
Condition 1) above indicates that group k has people available 
for retraining, and the fraction of grade i "promotees" promoted 
from group k is smaller than the fraction of grade i "leavers" 
leaving from group k. If the total stocks, En are increased 
then the number leaving the system from group k increases in 
direct proportion to one However, increasing si causes the 
overall promotion rate p to decrease, so the number promoted 
from group k does not increase as rapidly as si. Under 
condition 1) we then have a decrease in the number to be 
retrained when s; increases. 

Condition 2) is simply conđition 1) with the inequal- 
ities reversed. Under condition 2) there is a requirement for 
retrained people, but this requirement decreases as = 
increases. 

In this special case increasing total stocks decreases 
the supplies in the groups that have people available for 
retraining and decreases the requirements in the groups that 
have a need for retrained people. Consequently, increasing 


* k . . 
total stocks, Sis decreases the optimal retraining costs. 
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The rate of change of retraining costs with respect 


to promotion rates may be computed from 





* * 
an 

* * 
op; Y; (a; ar Pj) 


anad; 











H? al 1 
ap} as; ap; 


H. TWO-GRADE OPTIMIZATION OF RETRAINING COSTS 

In this section we develop a technique for jointly 
varying the stocks and promotion rates in two grades, i and 
i+l, in such a way that the stock and promotion rates in 
grades below grade i and above grade i+l need not be changed. 
We then show how the retraining costs in grades 1 and itl 
respond to these stock and promotion rate changes. 

Recall the balance equations (4) for grades i, itl, and 
1+2: 


f et * * * * 
a 8:1 S;_)P3-) = s.Y. (a.tp.a.) = s; (Y;a;) + s,p; (Y;4,) 


2: o: Le ELL 
i+]1) f. .l "Pe == es 
Os A +) 
= * * 


al eiT a aan Gain S A Pi? : 


è * s a 
1 and Pi-1 as fixed vectors. If S; 1S varied, 


then BN must also change in order to satisfy 1). Furthermore, 


We consider Si_ 


Kock . * . . 
the product s.P; cannot remaln constant as S; 1S varied. 


: : A P , Ro, a E f 
This means that satisfying i+l) when S; 1s varied will require 


* 
or P. 


Xx o, 
i+] OY both. Suppose that as s, is 


a change in s* 
E el 
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varied we change the value of 3, so that i+l) is satisfied 
* 


Es] is held constant. This is possible 


* 
and the produce S;47P 
* 
and the values of Sia) 


constant equation i+2) is satisfied without 


and BA are uniquely determined. By 


‘ * * 
holding Si41Pi+1 


changing stocks in grade it+2.° Thus we have restricted the 
effects of varying s; to grades i and itl; all other grades 
are unaffected. 

A matter of interest is how the retraining costs vary as 
a consequence of changing stocks by the foregoing technique. 
We have shown in the previous section how retraining costs 
in grade i respond to changes in stocks when the input flow 
is fixed. We next consider how retraining costs in grade 
i+l respond to changes in s; when the promotion flow from 
grade i+l, a is held constant. 

We use three equations: 


1) Balance equation 


= TR * * 
1 SPA SS Praia? (40) 


f 
1 


itl 


2) Retraining flow equation 
* kọ 
Tit, k ~ Fita,k * SiPivix*ix 
A e 
7 Si41%i+1,k iy Pid G41, (90) 
3) Fixed promotion flow constraint 
Si+1Pj+] 


Taking the pertial derivative of the balance equation with 


= constant. 


* 
respect to Si yields, 


ds”, Y.a. 
i i L 


93; ir 





Menpar tial derivative of total stocks in gra@e i+1l wath 


respect to total stocks in grade 1 is negative when Sail: i 


x x i i i ; 
and Si }]Pj4]1 are held constant. This may be explained in 
terms of the promotion rate in grade i. Decreasing pi 
causes an increase in si but a decrease in api This ün 


i * i i 
turn causes a decrease in S:4]1° Takıng the partial deriva- 


tive of the retraining flow equation then leads to, 


Sek 


* 
IS. 
i 


Yi+l,k Sk 2 Yik fi 


1k 
Bee) 


= y.a. ( 


1 eae 


pes 


We then have, 





O 

dil KE Yi+1,k %i+1,k  Yik “ix 

* 7 178, } Yi+l,k'Y a A ) 
IS; k=1 2 i+l i+l r 
and, 

O O 

E a) Se ane N Yi 

* we } man va. + E y.a 
95; k=1 1 ai $ ii 
y. a. 
i it] i+ 


This equation provides a rather practical means of guiding 
the planner in his search for an acceptable combination of 
stocks, promotion rates and retraining costs. The scalar 
coefficients of the optimal dual variables in (16) are 
readily computed, and they can be stored in a 3 x K matrix. 
For the value of s; specified by the planner the optimal dual 
variables for the corresponding ihre transportation 
problem are found and (16) is computed using 3K multiplica- 
tions and additions. The sign of the partial derivative ın 


(16) indicates to the planner how he might reduce retraining 
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costs. Whether the planner chooses to make the indicated 
change in total stocks to reduce costs will depend on how he 
perceives his constraints on stocks and promotion rates. At 
any rate he may change the stocks in grades i and i+tl so as 
to decrease retraining costs and confine the effects of the 


stock changes to grades i and itl. 
A special case of some practical interest is that in 
which retraining costs, 
1) are the same in grades i and itl, i.e., 
c; (k,m) = C} (Km), 
and, 
2) depend only on the group retrained into, i.e., 
c; (k,m) = c, (m). 
In this special case any primal feasible solution to the 
retraining transportation problem is optimal, and an optimal 
solution to the dual is, 


oo eK 
1k 1 
= =c. (k) if keK. . 
JE 1 


One then has, 


O O 
SN a A 
* De; } = c. (k) aE Lea) : 
os. kek. EU i itl i+l 
i i 
Roughly speaking, the above equation compares attrition rates 
in grades i and i+l weighted by retraining costs, and indi- 


cates that costs will be reduced by reducing stocks in the 


grade that has the greater weighted attrition rate. 


I. THE RELATION OF TRAINING COSTS TO TOTAL FLOWS 
In Sections G and H we studied the behavior of the optimal 


En: Fe 
refraınıng costs when the total stock in grade 1, Sir 18 the 
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independent variable. This approach has definite practical 
appeal because =" is likely to be of importance to the planner. 
In this section we shall consider the behavior of the optimal 
retraining costs when there are two independent variables: 
the total number of people entering grade i each period and 
the total number of people promoted out of grade i each 
period. The two scalar variables will be referred to as the 
"total flow into" and the "total flow out of" grade i. It 
should be noted, that the "total flow out of" grade i does 
not include the number of people that leave the system from 
grade i each period. 


We begin with an assumption about the external flows. 


Assumption. For grades i=2,:***»,n the external flows are 
zero vectors, 1.c., f; = 0, i=2,""",n. There is a known 
u 
distribution vector fi such that the external flow for grade 
% 
l is always a scalar multiple of fi: A £, always 
£ y 
satisfies — = f.. 
= 1 
f,1 


The practical implications of this assumption are that 
hiring is restricted to grade l, and the vector of people 
hired in grade 1 has a known distribution. The theoretical 
implications of this assumption are that the total flow out 
of grade i is the total flow into grade itl, and the distri- 
bution of any total flow is specified by £, or (Y.ea.)/Y,0; a 


(The latter point will be explained.) 


128 





Let us define a 1 x K vector IF (9¡1 1 0 19h? by 
g, = £, when 1=1 
= S;_18P;_11 when 1=2,**-*,n. 


We may also write g, as the product of a total flow and a 


distribution vector, 


— L $ 
(£,1) fi when i=l 


Sur 
CM y, 03) e | when i=2,-**,n. 
ee a 1 
where, 
£,1 = total flow into grade 1, (a scalar) 
se 5 Yi-1%;-] 7 total flow into grade 1, 1=2,°*** 7m 
mv 


(a scalar), and the vectors f, and Y ¿100 _1/Y 4-1%.-1 are 
distribution vectors. Thus the vector of the numbers of 
people entering the various groups in grade i, Sas is always 
equal to a scalar times a known distribution vector. 
By our definitions of “total flow into” and "total flow 
out of" any grade we have, 
g.1 = total flow into građe 1, 1=1,***,n 

= total flow out of grade i-l, 1=2,°°°,n. 
Note that the total flow out of grade n must be zero, so we 
define Gr = 0. 


+1 


Notation is simplified somewhat by defining, 


I; = g;l, a scalar, 


0 l, a scalar. 


AS 


the mnenomic I for "in" and 0 for "ae" is intended. 
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We define the "flow plane" for grade i as the non- 


E with points having coordinates (1,,0,). 


negative quadrant of E 
That is, the total flow into grade i, I,, and the total flow 
out of grade i, O., define a point in the flow plane for 
grade i. 


Lemma 1. For any point (Ijs 05) in the flow plane for 


grade i the values of Ss; and En that satisfy the balance 





equation are unique. In particular, 
LI. MOR 
wo Ue 1 
ee (17) 
ii 
Vowels O. O. 
* ie i u i 
Pi man =a * i (18) 
i i i i Seno. 
ena a. 


Proof. Recall that the balance equation (4) is, 


wer, 12, + Py? 


But under the assumption on external flows, 


Eee | Pye Spe 


so the balance equation can be written, 


i = .a. + a 
i SS EA 
But, 
S = s*p*y.qQ 
iPi iPiYi*i 
Zi + 
and, 
S = s* a 
wi ivi 1 
Thus, 
Ts a, + 
i ea i OF 
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Rearranging terms we have, 


x _ =- 
S= (I, DEN -> 


The total flow out of grade i is 


0. = anp: 
1 iPi 
= s* * a 
a PUGS v 
SO, 
x a 
Pj k 
Suede 
LI 


Vaak 0: 
(A 
vera" 
ii i i 





One may verify that substituting the above expressions for 


* 


S; and pP; into the balance equation reduces it to an 


identity. |] 
Lemma 2. For any point (I.s 0.) in the flow plane for 

grade 1 the vector of retraining flows r; = (X, 0 1 XL 

is well defined, and the retraining flow for group k in 


grade i, Leyes is given by 





ee o o n dKTik, x; (ik ik h ViKCik, 
» ° Ff . — . . . | Se = ay e 
iIk i i ik i va 1 y,a,; ‚Ya; 
Proof. From Section G, equation (14) gives ri, as a function 
of s* j 
1 
_ 9:1 Y. Oey. Y: O Y., A. 
Erlen) = le = ik ik, + s'y.a. ik = _ik ik, p 
Ya. A yog. Y.a. 
g;l 11 ee A: 1i 


From Lemma 1 we have 


1,21: 





Thus, 




















g. ade Vio sa 
T ik Vik "ik _ iktik _ Yirdik 
AA Det ai Ta 
AL LL LEL AL 
e o (ik ik _ Vixtik, 
Jik i* y.a. i “are. Yas °° 
en] LL I 


We must establish that Jik is a well defined function of I,- 


Let ex be a K x 1 vector having all zero components except 


for a l in its gen component. Let 


I 


% N 


Yaiza CPU 
po ld el when A 
Y.: Q. 
i-l] "1-1 


From the introductory discussion the distribution vector 


AY b 
g; is known. 


Eien 9. = (a = Te 
oo Si, SU 
= (g,1)g,e 
= Dee: | C] 


Lemmas l] and 2 demonstrate that total stock Se , overall 
promotion rate p and the retraining flows r; are uniquely 
determined by the total flows I; and 0. - It follows that 
the optimal retraining cost in grade i is also uniquely 
determined by these flows. 

The following notation is useful in the proof of theorem 
3 and in subsequent analysis: 


1) e 


V pw 
Jik 7 


where 


ik 


nm 
g. 
oe and e, are as defined in the proof of Lemma 2, 
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DA IA 
ik ik Y;a; 


Mii  Yik“ik 


DOE = 
aeo E: Yii 


+ 


We note that 9. 


er Diz and Biz are scalars; their values are 


r 


known and they do not depend on I. or 0.. 


We then have 








Va Vitaliy yY ani 

ooa Yik ikĉik Vat 

Bea on) ya ee re 
EL l 1 LoL 


Note that the retraining flows in grade i are linear functions 
of the total flows into and out of grade i. 


For any point (I., 0.) in the flow plane (excluding the 


Origin) define the ray through (Tis 0.) as R(i., Oir E. , 
R(I,, 0.) = {(x,y): for some c>0, x=cI;, y = c0,}. 
Theorem 3. For any point (I., 0,) in the flow plane for 


grade i (excluding the origin), 
1) the dual optimal solution to the retraining transportation 
problem, vo is the same for all points in R(I.,0,), 
2) the value of the optimal retraining cost for any point 
in R(I;,0;) is 
O _ O 
a; (cI;,, c0;) cd; (TI; 0.) (e>0):, 


3) the value of pi is constant in R(T, ,0;). 


Proof. We begin by writing the dual retraining transportation 
problem with the retraining flows as functions of the total 


flows into and out of grade li. 


1.38 





a 


DRTP] max d,(1,, 0,) = Fin Ir 0,) Vix 
ST - <c.(k,m); keK. K. 
E IS 
Vik’ Yim UNRESTRICTED. 


From Lemma 2 and equation (19), 


ha B 
mee O bik Ti t Pipi 


The objective function in DRTP may then be written, 


K 
d,(I,, 0,) = L 


fa. Tu 
1 k 1k 


1 i F Bik Onay 


ik 
Let Vi be a solution to DRTP at (I., 0.), L.€., 


K 
L (bip Ii + Buy 05) VG, - 


T OOT 
yi 


At any other point in R(I;s 0;) the cbjective function in 


DRIP is, 


k 
d,(el,, c0;) = L 


= cd, (I; , o) 


It is well-known that 
a linear program by a 


1) does not change 


multiplying the objective function of 
positive constant c, 


the value of the variables in the 


optimal solution, 


2) 


by the factor c. 


changes the optimal value of the objective function 
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Thus we have proven parts 1) and 2) of Theorem 3. 
To prove part 3) we note that from Lemma 1 the value of 


* a ee: m 
P; at any point in R(I,, 0.) ls, 





= MB a 
Pi = UN E where c>0, 
Yii cI. - c0. 
i i 
- eg 12, 
mo.’ m. -oo * 
ii i i 
h 1 fp. i tant in R(I., 0 
i.e., the value of p, is constant in ¡o 0;). = 


The flow plane for grade i is sketched in Figure 3. 
The constant-value curves for si and Be are depicted by dashed 
and solid lines respectively. Note that the ray on which 
I; = 0; corresponds to p; = © and s = 0. The ray on 
which 0; = 0 corresponds to Pp; = 0. 
We see from Theorem 3 that the optimal retraining cost 
is linear on any ray in the flow plane. Next, we consider 


the character of the optimal retraining cost in the region 


bounded by two rays, i.e., a cone in the flow plane. 


Lemma 4. If v3 is the dual optimal solution to the retraining 
transportation problem at two points in the flow plane, 
(Ii, 0; ) and (15, 05) , then vo is the dual optimal solution 


at any point (I: 0.) Seneca de for 0 < 0 < 1, 
= 1 t A u 11 
(I., 0.) as 9 (I; r 0.) + (1 Oo) (Is, 05) 


Proof. If vo solves the dual retraining transportation 


pull (DRTE) at(l:, 0°) and (I' , 0” ),, then, 
i i í 1 


1) vo is a feasible solution to DRTP everywhere in the 


flow plane for grade i, 
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Figure 3. Constant-Value Curves for s; and Pp; in the 


Flow Plane for Grade i. 


K 
O r 1 Ben 1 
| 2) d;(1},0}) = L (b 1; +8 


O 
ae apa ae 


K 
"i “NY O 
l EE E S Yk: 


Bu cd (ml on) 
2 1 1 ï 


bet, 


o 
2 
O 
< 
20 
Il 
~ 
o 


(20) 





We must show for 0 <@< 1, and 


Pee GO) + (1-6) (1, 08 ), (21) 


that a(r., 0.) =4a.(I., 0.; y 
1 l l l 


By substituting (21) into (20) we find, 


K 
. O, _ 1 ES 1 => n O 
at On ve = ey ea tt 6)Ii'] + 8,,[80, + (1-6) 0! J)v;, 


ag O 1 1 = O ti 


Because the dual optimal solution implies a basis for the 
primal optimal solution, it is readily verified that the sets 
KÍ and K. cannot change unless the dual optimal solution 
changes. 

Let x; solve the primal retraining transportation problem 
(PRTP) at (I!, 0!), and let x!' solve PRTP at (I!, 0!). Then 

a a a 1 i 
because the optimal values of the primal and đual linear 
programs are equal, we have, 

1 => O 1 y 
) ) ey (kem)xi (kym) = ay (Ty, 03), (22) 
keK. mek. 
1 a 


) o 
$ ) ea) =ra.il., 03). (23) 
keK, meK, > z ee = 


— yt a: 11 + r 
x, (k,m) = Ox: (k,m) + (1 0) x; (km), kek. , mek, . 


By hypothesis {xi (k,m) J and {xi(k,m) } are feasible in 


PRTP. Thus 
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em) ) (Oxi (kım) + (1-8) x!" (Kum)), KOR: 


MEK MEK. 
1 


= 8) ach, N, + (1-0) ) ee 


meK. meK, 
= t t Br 1 tt 
Or. (1, 0.) + (1 O)rs, (1s, 0:9). 


Using equations (19) and (21) we have, 


= + 
mex; 


Similarly it can be verified that 


A O ime | 
kek; 1 rea 1 1 


Furthermore, x. (k,m) > 0, XxX (KR) > O and O < 0 < 1 applies 
i 


that 
0x' (k,m) + (1-8)x"(k,m) > 0, 
l a 


mee, x. (k,m) > 0. We note that this is the only part of 
the proof that uses the condition 0 < 0 < 1. 
Thus we have shown that X; (k,m) is feasible in PRTP. 


Using (20), (22) and (23) we find that 


c. (k,m)x. (km) = 06d (1* , O') + (1-0) (1, O" 
DR pe x (Kom) x; (Kym) = 04 (17, 01) + (1-0)a; (Tj, Oj) 
T i 


=4,11,, 0, v2). 
So we have found a primal feasible solution {x, (k,m) } and a 
dual feasible solution ve and both give the same value of 
the objective function, so they are optimal solutions. In 


particular, vo is the dual optimal solution at (I;, 0,) as 


was to be shown. C] 
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If solution of the retraining transportation at two 
points in the flow plane results in the same dual optimal 
solution ee then the proof of Lemma 4 enables us to 
determine the extent of the cone in the flow plane in which 
the dual optimal solution does not change. 

Corollary 5. If two points in the flow plane, (I. , 0.) 
and (1, On), have solutions to PRTP, x; and xs respectively, 
and have the same solution to DRTP, Ae then for any 9 such 


that 


-8)x!' > 0 
Ox: + (1 9) xi >» 
the solutions to PRTP and DRTP at 


are, 
Ox! + (1-0)x" , 
i i 


and, 


O 
V. 
1 


respectively. 
Proof. Follows from the proof of Lemma 5. C] 

The foregoing results enable us to describe the optimal 
retraining costs as a function of the total flows into and 
out of grade i. 

Theorem 6. The optimal retraining cost in grade i is a 
piecewise linear convex function of the total flows into and 
out of grade i. 

Proof. The piecewise linearity of tħ optimal retraining 


we 


costs follows from equation (19), Lemma 4 and Corollary 5. 


239 





To show the convexity of the optimal costs, it will 
be convenient to use the original retraining cost minimization 


program developed in Section F. 


K K 
Pl) min » 7 C¿ (k,m)x, (k,m) 
k=1 m=1 
K 
ST e sg om) IZ O 


x, (k,m) > 0. 
1 tt 
Suppose x; solves Pl at (I., 0:), and x; solves Pl at 


CL; 0.). Then, 


L 
a(r} 0%) = 5 5 c. (k,m)x. (k,m), (25) 
I 1 1 k=] m=1 E l1 
and 
a = 5 5 Cc. (k,m)x. (k,m). iae 
I A k=1 m=1 1 T 
Let, 
x, = Ox. + (1-6) x. 


Then x; > 0, and, 


K K 1 1 iT ve 
), (Xx; (k,m)-x, (m,k)) ) [6 (x, (k,m)-x, (m,k) + (1-8) (x; (kım) — x; (m,k))] 


mel nel 
=0 ) (x, (k,m)-x, (m,k)) + (1-0) ) (x, (k,m)-x, (m,k)) 
mel z nel 


u A a0) 


Li lr 0;). 
So x, is a feasible solution to Pl at (I; 0.). The value 


of the objective function with this solution is, 


140 





K K K K t 5 
L 2 cj(k,m)x; (km) = )  )c¿(<,m10x,(k,m) + (1-0)x, (x,m)] 
k=1 mel ol 


N 


o.” ' o," 
Od. (I,, 05) + (1-0)d; (1; + 0;) 


from (25) and (26). 


But the minimal value of the objective function can be 
no greater than the value resulting from the feasible 


solution Xe, LAB, 
„ 


O O ' 1 O 1 
O) BEA A (ena (E 0 am 


We use the assumption that 0. = I jel,-.. ,n-1 to 


i-i’ 
combine the flow planes. 
Let us define an (n+1)-dimensional "flow space" as the 
E n+l . tala 
nonnegative orthant of E , and let the ı component of any 


point in this space be the total flow into grade 1, i=l,°*:°-,n, 


and the CE component is identically zero. Let, 


n 
O once a O 
AT eee) È dijs Ij? [27] 
ı=1 
where I 41 = 0. 
Theorem 7. The function ee Ls) is convex. It is 


linear on cones in which the optimal dual solutions do not 


change. 


Proof. The sum of convex functions 1S a convex function, so 


Theorem 6 and the definition of Augen >: St ) indicate it 


n+l 


is convex. 
We have from Corollary € that in each grade i, the 


optimal retraining cost gr I 


i ) is linear in any region 


iL 


in which the dual optimal solution v5 does not change. 
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Theorem 3 and Lemma 4 indicate that such regions are two- 
dimensional cones. The intersection of .such two-dimensional 
cones in the (n+1)-dimensional flow space defines a cone in 
pnt. The sum of linear functions is linear, so the latter 
part of the theorem follows. C] 
It has been shown in this section that under the 
assumption that nal flows are restricted to the lowest 
grade, the retraining costs are a convex O of the 
total flows of people between grades. Consequently, ina 
cost minimizing scheme such as that described in Section H, 
any local optimum is also a global optimum provided the set 


of solutions considered feasible is a convex set in the flow 


space. 
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APPENDIX A. SUMMARY OF NOTATION 


For the most part scalars and vectors are denoted by 
lower case letters, and matrices are denoted by upper case 
letters. There are exceptions. 

The subscript i is used to denote the first character- 
istic, grade. The subscript j is used to denote the second 
characteristic. 

A matrix of zeroes is denoted by 0. A vector of zeroes 
is denoted by 0. A vector of ones is denoted by 1. The 
dimensions of e O and 1 are implied by the context in which 
they are used. For any vector x we use xl to denote the sum 
of the components of x. 

Unless otherwise indicated the empty sum and the empty 


product are defined as identity elements for the correspond- 


ing operator. For example,, if B is a matrix then, 


i-l m 

) B, = 0, the additive identity matrix, when k>1, 
m=k 
i-1 

JI Bn = I, the multiplicative identity matrix, when K>i. 
m=k 


When a matrix is displayed in terms of its submatrices, 


the zero submatrices may not be shown explicitly. For 
example, 
oer i ce Pie 
Q = Dee 22 a 
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For any square matrix A we use A to denote the matrix 


dg 
which has the same elements as A on its main diagonal but 
has zeroes elsewhere. For any matrix B we use Bsq to denote 
the matrix which has the square of the elements of B as 
its elements. 

The matrix P; and the vector s; (t) are defined för 
1=1,°°*,n. For convenience we may use the convention P.=0 


and s,(t) = 0 for i<1ı. 


If x and y are two vectors such that 


(x ...,XxX_) 


X 
fee m 


and, 

Y 
then we define the congruent multiplication operator, €, 
by, 

a = EE en) 
Congruent multiplication is discussed in some detail in 
R. A. Howard, 1970. 

A "distribution vector" is a nonnegative vector having 
components that sum to one. Row vectors are enclosed in 
parentheses. Column vectors are enclosed in brackets. 


The symbol a denotes the end of a proof. 
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APPENDIX B. DIAGONAL MATRICES 


Let A be a matrix, not necessarily square, and let 
A (k,j) be the element of A in row i and column j. We say 


that A is a diagonal matrix if for some integer K A 5) 


a 
# 0 implies that j-i equals Ky: We call Ka the index of the 
diagonal matrix A. 

The key cell of the diagonal matrix A is defined as: 

(1) A (1,1) IE Ka = 0, 

(2) A (1,K, + 1) if Ka > 0, 


(3) A (1-k 1) ri Ka < 0. 


A’ 


We will use the notation (i ) for the row and column of 


A‘ IA 
the key cell of A. Note that Ja = in = Ko > 

If A is a zero matrix, then it is a diagonal matrix, and 
the value of its index is arbitrary. If A is a non-zero 
diagonal matrix, then its index is unique and its key cell 


is well defined. 


Let A be a non-zero mxn diagonal matrix, and let N be 


maximum number of non-zero elements in A. Then it is readily 


verified that 


z, = min {m + min (0, Kt, n - max {0, Ko +). 


All non-zero elements of A are in the set: 


+r); re 0,1,°°°,2, - 1}. 


This set is called the non-zero diagonal of A. 


Der A 


The purpose of this appendix is to show that diagonal 


matrices can be stored and multiplied in compacted form. 
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If A is an m x n diagonal matrix, then the compacted form 


* 
of A is a vector A where, 


* 
A (1) = m, 


* 
A (2) =n, 


* 


A (3) = Kar 


* 
A (4) = 


Zane 
and the fifth through last elements of A. see the elements of 
the non-zero diagonal of A. 

A zero matrix A may be stored as a four-component 


k 
vector A: 


* 
A (1) 


tl 
= 


* 
A (2) =n 


* * 
ALIS) RO AS) s m 


* 
AO 


We will show that the product of diagonal matrices is 
a diagonal matrix, and that the product is efficiently 
computed using the compacted forms of the multiplicand and 


product matrices. 


Lemma 1. If A and B are diagonal matrices such that C = AB 
is defined, then C is a diagonal matrix. Furthermore the 
index of C is the sum of the indices of A and B. 
Proof. If C is a zero matrix, then it is a diagonal matrix 
and the value of its index is arbitrary. 

If C is not a zero matrix, then for some row i and column 
Jr 

Zu, 570. 
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We note that C(i, Jj) is the inner product of the ¡En row of 


A and the ae column of B, and that each of the rows and 
columns of A and B contain at most one non-zero element. 


Consequently, there must exist nonnegative integers r, and 


A 
rp such that: 
C(i, j) = Ali, eG In = ra) Blip ON I + En) 
i = i, + 
J = dip t ra 
IN r, = in er: 


Starting with the last equation 


ee Ze: 
J = Jp + Ja + La = ln 
=: = Jp + In + lp = La 


ka + kp- |] 


Continuing with notation used above, we see that each 


element of the non-zero diagonal of C is the product of 
elements of the non-zero diagonals of A and B. We consider 
next how the elements of the non-zero diagonals of A and B 
are paired to form the non-zero diagonal of C. 

From the proof of the lemma, if C (1,3) # 0, then for 
some non-negative integers r, and r.: 


A B 


(1) C(1,3) = Ali, + ry, Ja + ra) Blig + rps Íg + Tp) 


(2) 1l = i, +T 
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B 
(4) tr, 
To find the relation between En and rp, we note that 
(J, + rp) - (i, + ra) = js 2 
= Ka + Kar 
so, 
Er - La = Ka + Ko - Ip ~ la 
= (Ko = jp) + (i, + ky) 
But, 
in + Ka = 1 + Ka Ir Ka > 0 
= ] if Ka er, 
and, 
kp -= Ja = -] If Kp ze) 
= Ka -] rI Ky <,0. 
Thus, 
— + 
En -rn = Ka + Ka 
+ 5 
where Ka = Ka IE Ka > 0 
= 0 otherwise, 
5 = ] < 
Ko Ko if Kp 0 
= 0 otherwise. 
Let d,. = ki + kt 
AB `B A” 
a er = E 
LL Ko t Ka = Anp 20 then r, = r, + da 2 10, Hama 
the first element of the non-zero diagonal of A is paired with 
st 


the (din es) element of the non-zero diagonal of B. 
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+ 


2) If kp + Ka = dam 0 then Ip = rR T dag? 0, and the 
first element of the non-zero diagonal of B is paired with 
the (la, element of the non-zero diagonal of A. 


Consider the case 


ra = “dap 
ee: 
PB dap 
Then, 
£ = 4° E 
Do ta SAB AB AB ' 
and, 
eo a” + 
1 = 41, ~ Gap 
=- 


J = Jp Y Ue 


If Ka > 0 or k 


LA 


0, then, 


1) àa SOS < 0 


AB B 
-> Jp = 1 
> 
2) A 0 > Ka 0 
=> In = ip 
+ 1 = 1 
3) dap = 0 > Ky = Ko = 0 
->+ 1 = j = 
So if Ka > 0 or ko SO eetctme mercer Os 3 eguals l in the 
equation 
TR- T 
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That is, C(i,j) is the key element of C. 


If Ky < 0 and Kp > 0, then dan = Q and 


Sg) = C(1 ) = C (l-ka? 1+k,), 


ar? 
and the first min {-k,, a elements of the non-zero diagonal 
of C are zeroes. 

The foregoing observations lead to the following algorithm 


for multiplying two diagonal matrices. 


- + 
l) Compute dam = Kp + KA 


'2) Drop the first dine elements of thé non-zero diagonal 
of A and store the remaining elements in a vector VA. 

3) ioe the first a elements of the non-zero diagonal 
of B and store the remaining elements in a vector VB. 

4) Drop elements from the end of the longer of the two 
vectors VA and VB until they are of equal length. Multiply 
corresponding components of these vectors to form a new vector 
NC. 

5) Append max {0, min ck) zera in front of Ne. 


Compute zZ the length of the non-zero diagonal of C, and 


es 
append zeroes to the end of VC as necessary to form the non- 
zero Giagonal of C. 

6) The first four elements of the compacted form of C 
are A(1l1), B(2), Ky + Kpr Zo: 


non-zero diagonal of C as computed in 5). 


The remaining elements are the 
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